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BACKGROUND OF THE INVENTION 



1 . Field of the Invention 

20 The present invention relates generally to data redundancy methods and 

apparatus. Various aspects relate more particularly to redundancy data 
generation, data restoration, data storage, redundancy adjustability, data 
communication, computer network operations, and code discovery techniques. 

25 2. Description of the Related Art 

With the explosive growth in the Internet and mission-critical 
applications, the importance of preserving data integrity and ensuring 24x7 
continuous access to critical information cannot be overstated. Information is 
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now recognized as a key organizational asset, essential to its operation and 
market competitiveness. Access to critical information on a continuous basis 
is a mandatory requirement for survival in the business world. Critical 
applications involving military operations, communications, audio-visual, 
5 medical diagnoses, ISP (Internet Service Provider) and Web sites, or financial 
activities, for example, depend upon the continuous availability of essential 
data. 

Downtime is extremely costly. Customers, vendors, employees, and 
prospects can no longer conduct essential business or critical operations. 

10 There is a "lost opportunity" cost to storage failures as well in terms of 
business lost to competitors. Well-documented studies place the cost of 
downtime in the tens of thousands (or even millions) of dollars per hour. 

The need for large amounts of reliable online storage is fueling demand 
for fault-tolerant technology. According to International Data Corporation, the 

15 market for disk storage systems last year grew by 12 percent, topping $27 

billion. More telling than that figure, however, is the growth in capacity being 
shipped, which grew 103 percent in 1998. Much of this explosive growth can 
be attributed to the space-eating demands of endeavors such as year 2000 
testing, installation of data-heavy enterprise resource planning applications 

20 and the deployment of widespread Internet access. 

Disk drive manufacturers publish Mean Time Between Failure (MTBF) 
figures as high as 800,000 hours (91 years). However, the claims are mostly 
unrealistic when examined. The actual practical life of a disk drive is 5 to 7 
years of continuous use. Many Information Technology managers are aware 

25 that disk drives fail with great frequency. This is the most likely reason why 

companies place emphasis on periodic storage backup, and why there is such a 
large market for tape systems. 

The industry answer to help satisfy these needs has been the use of 
conventional RAID ("Redundant Arrays of Inexpensive Disks") storage. In 
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general, RAID storage reduces the risk of data loss by either replicating critical 
information on separate disk drives, or spreading it over several drives with a 
means of reconstructing information if a single drive is lost. 

There are basically four elements of RAID: 1) mirroring data (i.e., creating 
5 an exact copy every time information is written to storage), 2) performing 
checksum calculations {parity data), 3) striping information in equal-sized 
pieces across multiple drives, and 4) having a standby hot spare should one 
drive fail. Some methods use a combination of both approaches. RAID storage 
systems are usually designed with redundant power supplies and the ability to 
10 swap out failed drives, power supplies and fans while the system continues to 
operate. Sophisticated RAID systems even contain redundant controllers to 
share the workload and provide automatic fail-over capabilities should one 
malfunction. 

Conventional RAID storage configurations have proven to be the best 
15 hedge against the possibility of a single drive failure within an array. If more 
than one drive in a RAID array fails, however, or a service person accidentally 
removes the wrong drive when attempting to replace a failed drive, the entire 
RAID storage system becomes inoperable. And the likelihood of multiple drive 
failures in large disk arrays is significant. The resultant cost of inaccessibility 
20 to mission-critical information can be devastating in terms of lost opportunity, 
lost productivity and lost customers. 

Accidents can contribute to multiple drive failures in RAID storage. 
Service personnel have been known to remove the wrong drive during a 
replacement operation, crashing an entire RAID storage system. In poorly 
25 engineered RAID systems, replacing a failed drive can sometimes create a 

power glitch, damaging other drives. General data center administrative and 
service operations also present opportunities for personnel to inadvertently 
disable a drive. 
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It is well-known that the likelihood of a drive failure increases as more 
drives are added to a disk RAID storage system. The larger the RAID storage 
system (i.e., the more disk drives it has), the greater the chance that two or 
more drives could become inoperable at one time. Here, the term "time" means 
5 the duration from the instant when a drive fails until it is replaced and 

data/ parity information is recovered. In remote locations, during holidays, or 
even during graveyard shifts, the "time" to drive recovery could be several 
hours. Thus, multiple drive failures do not have to occur at exactly the same 
instant in order to have a devastating effect on mission-critical storage. 

10 Given the plausible assumptions that drives fail independently at 

random times with a certain MTBF, and that they stay down a certain time 
after failing, the following conclusions may be drawn for large arrays of disks: 
(1) the frequency of single drive failure increases linearly as the number of 
disks n; (2) the frequency of two drives failing together (a second failing before 

15 the first is reconstructed) increases as n*(n-l), or almost as the square of the 
number of disks; (3) the frequency of three drives failing together increases as 
n(n-l)(n-2) or almost as the cube; and so forth. 

The multiple failures, though still less frequent than single disk failure, 
become rapidly more important as the number of disks in a RAID becomes 

20 large. The following table illustrates the behavior of one, two and three drive 
failure MTBFs given that single drive MTBF divided by downtime is very much 
greater than the number of drives: 



# of 
Drives 


1 


2 


3 


4 
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15 


20 


MTBF 
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b/3 
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b/45 
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c/10 


c/120 


c/455 
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Here a << b << c are mean time constants for a failure of one disk, a 
coincidental failure of two disks, and a coincidental failure of three disks, 
respectively. If one-disk MTBF is five 360-day years and downtime is one day, 
5 then a = 5 years, b = 4,500 years, and c = 5,400,000 years. If MTBF is reduced 
to 1 year and downtime increased to two days, then a = 1 year, b = 90 years, 
and c = 10,800 years. 

The consequences of a multiple-drive failure can be devastating. 
Typically, if more than one drive fails, or a service person accidentally removes 
10 the wrong drive when attempting to replace a failed drive, the entire RAID 
storage system is out of commission. Access to critical information is not 
possible until the RAID system is re-configured, tested and a backup copy 
restored. Transactions and information written since the last backup may be 
lost forever. 

15 Thus, the possibility of a multiple-drive failure is very high for mission- 

critical applications that run 24-hours daily on a continuous basis. Moreover, 
the larger a RAID storage system, the greater the potential of suffering 
multiple-drive failures. And the chances increase significantly for remote 
locations where the response time to replace a failed drive can extend to several 

20 hours or even days. 

Conventional RAID levels have their advantages and disadvantages. 
While RAID-0 delivers high performance, it cannot sustain even a single drive 
failure because there is no parity information or data redundancy. Although 
the most costly, mirroring data on separate drives (RAID-1), means that if one 

25 drive fails, critical information can still be accessed from the mirrored drive. 
Typically, RAID-1 involves replicating all data on two separate "stacks" of disk 
drives on separate SCSI channels, incurring the cost of twice as many disk 
drives. There is a performance impact as well, since data must be written 
twice, consuming both RAID system and possibly server resources. RAID-3 
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and RAID-5 allow continued (albeit degraded) operation by reconstructing lost 
information "on the fly" through parity checksum calculations. Adding a global 
hot spare provides the ability to perform a background rebuild of lost data. 
With the exception of costly RAID- 1 (or combinations of RAID- 1 with 
5 RAID-0 or RAID-5) configurations, there have been few solutions for recovering 
from a multiple drive failure within a RAID storage system. Even the 
exceptions sustain multiple drive failures only under very limited 
circumstances. For example, a RAID-1 configuration can lose multiple (or all) 
drives in one mirrored stack as long as not more than one drive fails in its 
10 mirrored partner. Combining striping and parity within mirrored stacks buys 
some additional capabilities, but is still subject to these drive-failure 
limitations. 

Some variations of RAID are based merely on combinations of RAID 
levels, described below in terms of basic structure and performance (0+1 array, 

15 5+1 array, and 5+5 array). All of the packs described in the following 

configurations are assumed to have pattern designs that maximize read and 
write speed for large files and parallel data flows to and from disks. The "ideal" 
speeds will be based on raw data movement only, ignoring buffering and 
computational burdens: In a "0+1" array, two striped arrays of five disks each 

20 mirror the other. A striped array (RAID-0) is lost if only one of its disks is lost, 
so the safe loss count = 1 and maximum loss count = 5 (depending whether 
disks lost are on same side of mirror). Data capacity = 5, read speed =10 
(using an operating system capable of alternating mirror reads to achieve full 
parallelism; the usual max is 5) and write speed = 5 (here reading assumes a 

25 strategy of alternating between sides of the mirror to increase the parallelism). 
In a "5+1 " array, two RAID-5 arrays of five disks each mirror each other. Safe 
loss count is 3 (when one side has lost no more than one disk, the other 
perhaps more, we can still recover), max loss count is 6 (one entire side, and 
one disk from the other side). Data capacity is 4 (equals that of one RAID-5 
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array), read speed =10 but usual max is 5 (see above discussion of a 0+l"), and 
write speed = 4 (using full parallelism, but with parity and mirror burdens). 
Similar results arise from a 1+5 array (a RAID-5 made of mirrored pairs). In a 
"5+5" array, three RAID-5 arrays of three disks each form a RAID-5 with 
5 respect to each other. Thus one entire array of three can be lost, plus one disk 
of each of the other two. This implies safe loss count = 3 (it can't tolerate a 0 - 
2-2 loss pattern) and max loss count - 5. Data capacity is 4 (of 9), read speed 
is 9 (using nested striping) and write speed is 4. 

Other RAID-like variations exist, but with their downsides. A highly 

10 complex encryption-type multiple redundancy algorithm exists, referred to as 
the Mariani algorithm (downloaded file "raidzz" and related files). The form of 
RAID described by Mariani can either be applied to dedicated parity disks or 
have rotation superimposed (as with the two patents referred to below), and 
additionally requires encryption, which does not treat every bit in a chunk 

15 identically. In addition, the subject matter in U.S. Patent Number 5,271,012 
("Method and Means for Encoding and Rebuilding Data Contents of up to Two 
Unavailable DASDs in an Array of DASDS" and in U.S. Patent Number 
5,333,143 ("Method and Means for B-Adjacent Coding and Rebuilding Data 
from up to Two Unavailable DASDs in a DASD Array*) address multiple 

20 failures, but are limited. The form of RAID described in these patents 

generates two parity stripes as a function of n-2 data stripes. The two parity 
stripes (on two of the disks) are all parity; the n-2 data stripes (on n-2 of the 
disks) are all data. This leads to read inefficiency unless a rotation structure is 
superimposed on the formula, in which case it leads to algorithmic inefficiency. 

25 Accordingly, what is needed are methods and apparatus that overcome 

these and other deficiencies of the prior art. 
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A data storage apparatus has a plurality of n disks and data comprising 
a plurality of n data groupings stored across the plurality of n disks. Each one 
5 of the n data groupings comprise a data portion and a redundancy portion. 
Advantageously, the n data portions are recoverable from any and all 
combinations of n-m data grouping(s) on n-m disk(s) when the other m data 
grouping(s) are unavailable, where 1 < m < n. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is an illustration of a disk array configured in accordance with 
principles of the invention, where n is the number of disks and m is the 
number of failures allowed. 
15 FIG. 2 is an example of a conventional RAID configuration. 

FIG. 3 is an example of an inventive configuration comparable to the 
conventional RAID configuration of FIG. 2. 

FIG. 4 is an illustration of multiple failure conditions of a storage system 
from which data is recoverable. 
20 FIG. 5 is an illustration of multiple failure conditions of a storage system 

from which data is recoverable. 

FIG. 6 is a diagram of hardware which may embody principles of the 
present invention. 

FIG. 7 is an illustration of various software components which may 
25 embody principles of the present invention. 

FIG. 8 is an example of a disk array where m (the number of disks) is five 
and n (the maximum number of failures allowed) is two. 

FIG. 9 shows an exemplary redundancy data generation matrix and 
functions for m=2 and n=5. 
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FIG. 10 shows an exemplary data recovery matrix and functions for m=2 
and n=5, for data on disks A and B being unavailable. 

FIG. 1 1 shows an exemplary data recovery matrix and functions for m=2 
and n=5, for data on disks A and C being unavailable. 
5 FIG. 12 shows an exemplary data recovery matrix and functions for m=2 

and n=5, for data on disks A and D being unavailable. 

FIG. 13 shows an exemplary data recovery matrix and functions for m=2 
and n=5, for data on disks A and D being unavailable. 

FIG. 14 is a diagram of a computer system having a redundancy data 
10 generator and data recovery component. 

FIG. 15 is a schematic block diagram of a storage controller operative in 
connection with a disk array. 

FIG. 16.1 is a schematic block diagram of an adjustable redundancy and 
recovery apparatus. 
15 FIG. 16.2 is a flowchart describing a method of providing data 

redundancy adjustments. 

FIG. 17. 1 is a flowchart describing a method of storing data on the disk 

array. 

FIG. 17.2 is a flowchart describing a method of generating redundancy 

20 data. 

FIG. 18. 1 is a flowchart describing a method of providing data from the 
disk array. 

FIG. 18.2 is a flowchart describing a method of recovering data. 
FIG. 19 is a block diagram of a communication system, which here 
25 provides for radio frequency (RF) communication. 

FIG. 20 is a schematic block diagram of a communication device of FIG. 

19. 

FIG. 2 1 is a timing/data diagram of the communication system of FIGs. 
19 and 20. 
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FIG. 22 is the timing/ data diagram of FIG. 21, where some data portions 
are unavailable. 

FIG. 23 is a timing/data diagram of the communication system of FIGs. 
19 and 20. 

5 FIG. 24.1 is the timing/data diagram of FIG. 23, where some data 

portions are unavailable. 

FIG. 24.2 is a flowchart describing a method of communicating data in 
the communication system of FIGs. 19 and 20. 

FIG. 25 is a diagram of a local area network (LAN) in communication 
10 with a switch controller. 

FIG. 26 is a diagram of a wide area network (WAN) in communication 
with telephone carriers. 

FIG. 27 is a computer network having multiple servers. 

FIG. 28 is a first table showing data reception times of the computer 
15 network of FIG. 27. 

FIG. 29 is a second table showing data reception times of the computer 
network of FIG. 27. 

FIG. 30 is a third table showing data reception times of the computer 
network of FIG. 27. 

20 FIG. 31 is a flowchart describing a method of data communication of the 

computer network of FIG. 27. 

FIG. 32 is a flowchart describing a method of determining and/ or 
verifying whether a candidate bit matrix is a Wiencko bit matrix. 
FIG. 33 is an example of a Wiencko bit matrix. 
25 FIG. 34 is an example of defining subarrays and composite submatrices 

from a bit matrix where m=2 and n=5. 

FIG. 35 is an example of defining subarrays and composite submatrices 
from a bit matrix where m=3 and n=5. 
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FIG. 36 is an example of defining subarrays and composite submatrices 
from a bit matrix where m=4 and n=5. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

5 

Extensive research and development has resulted in software algorithms 
that may be used to augment RAID storage technology by performing 
automatic, transparent recovery from multiple drive failures without 
interrupting ongoing operations. The inventive techniques extend RAID 

10 functionality in ways that allow for instantaneous data recovery in the event of 
multiple simultaneous or near simultaneous disk failures in a disk array. 

In accordance with one inventive aspect, a data storage apparatus has a 
plurality of n disks and data comprising a plurality of n data groupings stored 
in the plurality of n disks respectively. Each one of the n data groupings 

15 comprise a data portion and a data redundancy portion. Advantageous^, the 
n data portions are constructible from any and all combinations of (n-m) data 
grouping(s) from (n-m) disk(s) when the other m data grouping(s) are 
unavailable, where 1 < m < n. 

Thus, in an n-drive system, continued operations are possible even in the 

20 event of any combination of up to m drive failures (where 1 < m < n). This adds 
a ground-breaking element to "extreme uptime" disk array technology, one that 
may find a natural home in many 24x7 operational environments. Moreover, 
because these algorithms have exceptionally fast computational speeds, 
storage transfer rate performance actually increases while adding virtually 

25 unlimited data protection. Compared to conventional RAID storage, the 

inventive storage system improves storage reliability while enhancing overall 
system performance. The functions are simple and may be easily stored in gate 
level logic or logic arrays. Preferably, minimal and "sparse* encoding functions 
are selected to minimize the amount of logic needed to encode the redundancy 
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information. The algorithms may also allow for users to select the degree of 
"disk-loss insurance" desired. 

As an example, an 18-disk array can be "coded" to allow three disks to be 
missing (n=18, m=3). With an "18/3" configuration, a total of fifteen disks 
5 worth of information would be stored on the 18-disk array. 1 / 18th of the 

original data is placed on each of the eighteen disks, and an additional twenty 
percent of coding information is included on each disk. Any three of the 
eighteen disks may fail and the full data can be reconstructed, in the same way 
that a single disk of an 18-disk RAID-5 array can fail and the data can be 

10 reconstructed. 

As described herein, many inventive aspects relate to providing useful 
and practical applications for a new family of codes, herein referred to as 
"Wiencko codes" (pronounced "WEN-SCO" codes). The codes may be applied in 
connection with simple exclusive-OR (XOR) logic. In data storage applications, 

15 this is a family of codes where n is the total number of disks in the array, and 
m is the number of disks that are allowed to be missing while still being able to 
fully recover the data. When n-m disks survive, the original data can be fully 
recovered. The efficiency is ideal, because the original quantity of "clear data" 
would occupy n-m disks. The inventive techniques may extend RAID 

20 functionality, while still retaining the advantages of traditional RAID 
techniques. 

In a more particularly described application, a disk array has n disks 
with a stripe set stored across the n disks. Each stripe of the stripe set has a 
data portion of H data bits and a redundancy portion of Q redundancy bits. A 
25 relationship exists between the data portions and the redundancy portions 

based on an n*H by n*Q bit matrix. The bit matrix is representible by an n by 
n array of H by Q bit submatrices, where n/m = (H/Q) + 1. The bit matrix also 
has a plurality of n!/(m!*(n-m)!) composite bit submatrices definable therefrom. 
Each such composite bit submatrix is more particularly definable from bit 
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submatrices at the intersection of a unique selection of m column(s) of the n by 
n array and a unique selection of (n-m) row(s) of the n by n array that 
correspond to those (n-m) column(s) not included in the unique selection of m 
column(s). Each one of these composite submatrices is invertible. The 
5 relationship between the data portions and the redundancy portions is such 
that each one of Q redundancy bits is the exclusive-OR of the n*H bits of the 
data portions and the n*H bits in the row of the bit matrix associated with such 
redundancy bit. 

10 Data Storage. FIG. 1 is an illustration of a plurality of n devices 100, of 

which up to m devices 102 may be unavailable and the user data is still 
recoverable. A number of stripe sets are spread across the n devices 100, such 
as a plurality of n stripe sets 106 and a plurality of n stripe sets 108. Each one 
of n devices 100 has a stripe which includes user data (indicated as u d n ) and 

15 redundancy data (indicated as V). The user data is in "clear" form and the 
redundancy data is generated based on Weincko redundancy generation 
functions. 

Referring now to FIG. 4, an illustration shows a number of different 
failure conditions 400 that a storage system can tolerate using one exemplary 

20 configuration of the present invention. The storage system has five disk drives 
(A through E), where any and all possible combinations of up to two drives may 
fail and the data is still recoverable. An "X" over a disk drive indicates a failure 
for that drive. A failure condition 402 shows that disk drives A and B have 
both failed; a failure condition 404 shows that disk drives A and C have both 

25 failed; a failure condition 406 shows that disk drives A and D have both failed; 
and so on for a failure condition 408 where disk drives D and E have both 
failed. For all of these failure conditions, the data is still recoverable from the 
storage system. These failure conditions are more fully summarized in a table 
410 in FIG. 4. 
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FIG. 5 is a diagram illustrating a number of different failure conditions 
500 that an inventive storage system can tolerate using another exemplary 
configuration of the present invention. Here, the storage system has five disk 
drives (A through E) where any and all possible combinations of up to three 
5 drives may fail and the data is still recoverable. A failure condition 502 shows 
that disk drives A, B, and C have failed; a failure condition 504 shows that disk 
drives A, B, and D have failed; a failure condition 506 shows that disk drives A, 
B, and E have failed; and so on for a failure condition 508 where disk drives C, 
D, and E have failed. For all of these failure conditions, the data is still 

10 recoverable from the storage system. These failure conditions are more fully 
summarized in a table 510 in FIG. 5. 

As apparent, the redundancy configuration of FIG. 5 provides more data 
redundancy than that of FIG. 4. The tradeoff should be apparent: a storage 
system with more data redundancy (FIG. 5) is less likely to have a 

15 nonrecoverable failure but will have less space to store user data. Users of a 
storage system may want some flexibility to adjust this data redundancy. As 
will be described in more detail below, the inventive storage system may allow 
for adjustment of a disk loss insurance" in response to user input (e.g., 
administrative control data). The failure conditions shown in FIGs. 4 and 5, for 

20 example, could be failure conditions from the same storage system in two 

different programmed configurations. In one embodiment, the variable m can 
be made adjustable from 1 to (n-1). 

Data Storage and Wiencko Codes. Definitions are provided for 
25 discussion. These definitions should not be used in a limiting sense when 
construing terms, but are provided in order to teach those skilled in the art 
how to practice aspects of the invention in connection with the examples 
provided. 
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A "Wiencko system* is a collection of similar stripe sets occupying a set 
of n parallel devices, having the capabilities of data storage, parity formation, 
and reconstruction as described below. A "device" may be a physical disk, a 
partition of a physical disk, a file on a physical disk, or other data storage 
5 hardware or software or parallel data communications hardware c>r software of 
any sort. A "disk" in the context of this description is the same as device. A 
"physical disk" may be a hard drive, floppy drive, or other randomly-accessed 
digital block storage device, or hardware or digital programming of equivalent 
functionality. 

10 A "stripe" is that part of a stripe set that resides on a single disk. Every 

stripe of a stripe set is an integer number of chunks, and starts and ends at 
chunk boundaries. A certain number of chunks of the stripe consist of data, 
and the rest consist of parity. A "stripe set" is a minimal set of chunks, 
distributed over all n disks in a Wiencko system, that form a self-contained 

15 whole from the point of view of data storage, parity formation, and 

reconstruction algorithms. The capabilities of a Wiencko system are fully 
realized over a stripe set. Stripe sets need not influence one another's parity or 
reconstructed data. 

A "block" is the minimum contiguous collection of data that can be read 

20 from or written to a device. In the case of a physical disk, a block is often some 
multiple of 512 bytes (4096 bits); other communication devices may have 
usable block sizes as small as one bit. A "chunk* is the maximum contiguous 
collection of data that is treated as an indivisible unit by the Wiencko code. 
Every bit of a chunk is treated identically, in data and parity operations 

25 relating to bits in other chunks at the same relative bit position. A chunk is an 
integer number of blocks, and starts and ends at block boundaries. A "code 
stripe" refers to those chunks of a stripe that consist of parity. A "data stripe" 
refers to those chunks of a stripe that consist of data. 
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A Wiencko code n.m, where n > m > 0 are integers, may be utilized for 
storing binary data on n similar devices, in such a way that any m of the 
devices can be removed or destroyed and all of the data are still recoverable. 
(Here, "similar" means their raw data storage capability is the same when all 
5 organized by one chunk size, but the devices themselves may be physically 
dissimilar.) 

A Wiencko code may be associated with the following terms and 
relationships. Data is organized in chunks of a certain number of bits each. D 
chunks form a stripe (wholly on one of the n disks) and n stripes, one on each 

10 disk, form a stripe set. D is a certain positive number which is divisible by Dl, 
where Dl is defined to be n/gcd(n,m), and gcd(n,m) is the greatest common 
divisor of n and m. The integer parameter p is defined to be p = D/Dl. Chunk 
size and stripe size are constant throughout the Wiencko system (at least at 
one given time), and each disk of the Wiencko system contains the same 

15 number of stripes, all organized into stripe sets. For Q = (mD)/n (this is an 
integer because Dl divides D), and H = D-Q, every stripe consists of H data 
chunks and Q parity chunks. There is a one-to-one relationship between the 
raw decoded data stored on the Wiencko system and the data found in the data 
chunks of all the stripes of all the stripe sets in the Wiencko system. 

20 It follows that the raw data capacity of the Wiencko system equals the 

theoretically maximum value of n-m, where data capacity is measured in units 
of one raw disk capacity. It also follows that for a system in which timing is 
dominated by data transmission timing to and from disks, and data 
transmission in parallel to or from any set of disks overlaps with no significant 

25 time penalty, the Wiencko system approaches the theoretical maximum raw 

throughput of n for large reads and n-m for "large" writes, where throughput is 
measured in units of one raw disk throughput and "large" means spanning 
many stripe sets. 
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The data in the parity chunks of a stripe set is determined from the data 
in the data chunks of that stripe set, using a bitwise XOR matrix. Here, 
"bitwise" means that the k-th bit of any parity chunk is independent of all but 
the k-th bit of each data chunk in that stripe set (where k is any natural 
5 number less than or equal to the chunk size in bits), and the formula for 
calculating this bit is the same for every k in every stripe set of this Wiencko 
system. "XOR matrix" means that the formula includes defining a given parity 
bit by specifying a subset of its available data bits and XORing all the data in 
those places. Thus, there are n*Q parity formulas, and each parity formula 
10 consists of specifying a subset of n*H data bits, one in the k-th position in each 
chunk. 

In addition to being a bitwise XOR matrix, a Wiencko code should be 
solvable for any combination of m lost disks. A bitwise XOR parity matrix is 
defined to be solvable for a combination of m disks if the m*H data chunks of a 

15 stripe set on those m disks can be uniquely calculated from the (n-m)*Q parity 
chunks and (n-m)*H data chunks of that stripe set on the remaining n-m 
disks. For a given n.m, a Wiencko code solution is not necessarily unique for 
that n.m. For each case, there is a smallest positive integer p for which a 
Wiencko code exists with parameter p as defined above. A smaller value of p 

20 greatly improves the computational efficiency of the algorithm. However, there 
are cases where it has been found hat no Wiencko code exists where p= 1 . 

A subset of the Wiencko codes, which have advantages in calculation, 
proving, key storage, and encoding and decoding efficiency, are rotationally 
symmetric Wiencko codes. These satisfy the following additional condition: if 

25 each disk is given a unique index between 0 and n-1, then the parity formula 
for a given parity (defined by its parity index between 0 and Q- 1 , and its disk 
index) specifies its subset from the data chunks (each defined by its data index 
between 0 and H-l, and its disk index) using only the difference modulo n 
between the parity disk index and any data disk index, not the absolute data 
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disk index. Thus for a given parity index, the parity formulas are the same for 
all the disk indices, except for a rotation modulo n of disk indices. 

Further details regarding Wiencko codes are now described. In the 
following discussion, a "bit" shall mean a zero or one, considered as operating 
5 under the addition and multiplication rules of the field of order 2. The common 
name for such addition is "XOR" and for such multiplication is "AND". A "bit 
vector" shall be a vector whose scalar entries are bits, and a "bit matrix" shall 
be a matrix whose scalar entries are bits. Operations on bit vectors and bit 
matrices are analogous to standard vector and matrix operations, following the 
10 rules of linear algebra over the field of order 2. 

Let H and Q be positive integers, and j and k be positive integers. A "j by 
k array of H by Q matrices" is a set of j*k matrices, each of dimension H by Q, 
arranged in double subscript order with the j subscript running fastest 
(horizontally), where the component matrices themselves have the H subscript 
15 running fastest. 

This may be visualized as a matrix whose entries are themselves 
matrices: 



20 



Moo Mio .... Mjo 

Moi Mn .... Mji 



25 Mok Mik .... Mjk 

where J=j-1, and K=k-1. Each M is itself an H by Q matrix. 

An "equivalent matrix to an array of matrices" (with dimensions as given 
above) is the j*H by k*Q matrix that follows by resolving each matrix entry 
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above into its array of scalar entries. Thus, the x+z*H, y+w*Q entry of the 
equivalent matrix is the x, y entry of Mzw in the above array of matrices, where 
0<=x<H, 0<=y<Q, 0<=z<j, 0<=w<k. 

Let n > m > 0 be integers. A "Wiencko array of matrices of type (n, m) M 
5 is an n by n array of H by Q bit matrices, where H and Q are positive 
integers such that 



n/m = (H+Q)/Q = H/Q+ 1, 

10 and the array satisfies the following property: for every set S of m integers 
between 0 and n- 1 , the subarray , created from the array by keeping those 
array matrices whose row index w is not in S and whose column index z is in S, 
has an invertible equivalent matrix. Here, "invertible" has the usual matrix 
theoretic meaning as applied to matrices over the field of order two. A common 

15 term for this is "parity invertible" and the matrix is a "parity matrix." Note that 
the process described above creates an m by (n-m) array of H by Q matrices, 
and by the relationship above, its equivalent matrix is a square matrix. 

A "Wiencko matrix" is an equivalent matrix to a Wiencko array of 
matrices. A "zero diagonal Wiencko array of matrices" is a Wiencko array of 

20 matrices that satisfies the additional condition: Mzw = 0 for each matrix such 
that z=w (that is, each diagnal entry). A "rotationally symmetric Wiencko array 
of matrices" is a Wiencko array of matrices satisfying the following additional 
condition: Mzw = M U v whenever z-w = u-v modulo n. A "zero diagonal, 
rotationally symmetric Wiencko array standard description" is the subarray of 

25 a zero diagonal, rotationally symmetric Wiencko array of matrices created by 
keeping those entries for which w>0 and z=0. The above conditions make it 
easy to generate the entire Wiencko array from its standard description. Zero 
diagonal, and rotational symmetry, and sparseness or near-fullness, are among 
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the conditions on Wiencko matrices that provide more efficient 
implementations of the algorithm, but are not necessary for functionality. 

The following description relates to an example of a Wiencko array. A 
Wiencko array of type (5,2) with H=3 and Q=2 is shown in FIG. 33. Inspection 
5 shows that it is zero diagonal and rotationally symmetric; therefore it can be 
represented by the standard description 

010 - 

100 
10 100 - 

001 

010 - 

001 

000 - 
15 000 

where the dashes show the bounds of the array entries. It now remains to be 
proven that it satisfies the "invertibility" conditions. Because of the rotational 
symmetry, the ten possible combinations (n!/(m!*(n-m)!) = 5!/2!*(5-2)I = 10) 
20 found in the subset S of two numbers between 0 and 4 reduce to only two 
essentially different ones: S = (0, 1) and S = (0, 2). 

The S = (0,1) case reduces to those submatrices in the intersection of 
columns "a" and "b" and the rows V, "d", and "e": 

25 100010 100010 

001100 011101 

010100 Invertible => 000101 

001001 010101 

000010 000010 
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000001 



000001 



The S = (0, 2) case reduces to those submatrices in the intersection of 
columns "a" and a c" and rows a b", u d", and V: 



10 



010000 
100000 
010010 
001100 
000100 
000001 



Invertible => 



010000 
100000 
000110 
000010 
101000 
000001 



Since both of these are invertible, so are the other eight (due to the rotational 
symmetry). Therefore, the bit matrix in FIG. 33 is a Wiencko array. 

15 Referring now to FIG. 32, a flowchart is shown which describes a method 

for determining and/or validating that a given bit matrix is a Wiencko bit 
matrix. This method may be embodied in the form of software executable by a 
processor. Beginning at a start block 3200, data that is representative of 
candidate bit matrix is received (step 3202). This data represents one possible 

20 Wiencko solution for a given m and n. More particularly, the candidate data is 
representative of an n*H by n*Q candidate bit matrix, which is representible by 
an n by n array of H by Q bit submatrices, where n/m = (H/Q) + 1. In this 
method, it may be tested that the n/m = (H/Q) + 1 condition is satisfied. 

Referring to FIG. 33, an example of such a candidate bit matrix 3300 is 

25 shown for an m=2, n=5, H=3, and Q=2 case. As illustrated, candidate bit 

matrix 3300 is representible by a 5-by-5 array of 3-by-2 bit submatrices 3302, 
such as a submatrix 3304. 

Back to FIG. 32, an H*m by Q*(n-m) composite submatrix is selected 
from the candidate bit matrix (step 3204). The composite submatrix is formed 
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by submatrices from the intersection of a unique selection of m column(s) of 
the array and a unique selection of (n-m) row(s) of the array that correspond to 
those column(s) not included in the unique selection of m column(s). There are 
n!/(m!*(n-m)!) such composite submatrices that may be formed from the 
5 candidate bit matrix. Referring back to FIG. 33, an example of such a 

composite submatrix 3206 is shown for the m=2, n=5, H=3, and Q=2 case. As 
illustrated, composite submatrix 3206 is formed by submatrices 3302 taken 
from the intersection of a unique selection of m (two) columns of the array 
(columns B and D) and a unique selection of n-m (three) rows of the array that 

10 correspond to those n-m (three) columns not included in the unique selection 
of m (two) columns (columns A, C, and E). FIGs. 34, 35, and 36 show further 
examples of forming such subarrays and composite submatrices. More 
particularly, FIG. 34 shows all such composite submatrices for m=2 and n=5; 
FIG. 35 shows all such composite submatrices for m=3 and n=5; and FIG. 36 

15 shows all such composite submatrices formed for m=4 and n=5. 

Back to the flowchart of FIG. 32, the composite submatrix is tested for 
invertibility (step 3206). If the composite submatrix is invertible at step 3206, 
it is determined whether all n!/(m!*(n-m)!) composite submatrices associated 
with the candidate bit matrix have been tested (step 3210). If all such 

20 composite submatrices have been tested and are invertible, the overall test is 
satisfied and the candidate bit matrix is a valid solution (step 3212). If the 
composite submatrix is not invertible at step 3206, the test fails and the 
candidate bit matrix is not a solution (step 3208). If any other candidate bit 
matrices are to be tested (step 3214), the flowchart repeats at step 3202 (the 

25 test may have failed at step 3208, or multiple solutions may be desired after 

completion of step 32 12). If no other candidate bit matrices are to be tested at 
step 3214, the flowchart ends at a finish block 3216. 

The above relationships describe Wiencko codes that are "ideaT and 
preferred codes. It is understood, however, that variations on Wiencko codes 
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may be utilized. For example, codes may be provided such that for every value 
of 11, 0< ]i <« m, a subset of all possible combinations of \i unavailable disks is 
recoverable. Any available codes that provide recovery for a limited subset of 
all combinations of m failures (perhaps a majority) may be utilized. 
5 Reference is now made to FIG. 8. Some of the relationships and 

examples described above may be repeated in the following description and 
examples for further clarity. The data and code layout for a disk array with a 
"5/2" configuration is shown in FIG. 8. Assume a chunk size is one bit. To 
begin the procedure, a set of operations must be performed on stripes of data. 

10 As described above, the number of bits in minimum stripe on a disk depends 
on a number of factors, including n, m, and a matrix expansion coefficient p 
that (in many cases) is one. Another contributor is the greatest common factor 
of the two numbers n and m, which is represented by GCF{n, m). The number 
of bits in a minimum data stripe is H = p*(n-m)/GC^n, m) per disk. The 

15 number of bits in a minimum code stripe is Q=p*m/ GCF[n, m) per disk. For a 
"4/2" configuration, with four (4) total disks coded to allow two to be missing, a 
solution exists in which p=l, and GCF\4, 2)=2, so H=l and Q=l. For a "5/2", 
n=5, m=2, p=l, GCF{5, 2)=1, so H=3 and Q=2. For a "5/3" configuration, n=5, 
m=3, p=l, GCFXS, 3)=1, so H=2 and Q=3. 

20 The user data bits may be stored in "clear* form, so when writing data, 

the only additional task is to calculate and store the code bits on each disk. 
Each bit of the code stripe of a given disk is an XOR of some of the data bits of 
the other disks. An encoding mask r can be defined, composed of mask 
elements rnm in which a mask element value of "I" means inclusion in the XOR 

25 calculation, and a value of "0" means exclusion from the XOR calculation. 
Suppose n=4 and m=2. The encoding matrix for data bits A, B, C, and D to 
yield code bits A', B', C\ and D' can be represented as follows, with "(+)" 
denoting "exclusive OR": 
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A'=r ul A{+)r X2 *B(+)r x ,C(+)r XA D 
= r w ^(+)r M fi(+)r M C(+)r 2 4 Z> 
C= r 3 1 ^(+)r 3t2 ^(+)r3 ( 3C(+)r 3(4 Z) 
Z>'= r 4 ^(+)r 4 i2 B(+)r 4tJ C(+)r 4i4 D 

5 

Alternatively, this equation set may be represented simply be referring to the 
mask, r = 

{ri.i ri,2 n,3 n,4} 

{r2,l T2,2 T2 P 3 T2,4} 

10 {r3,l T3 f 2 T3,3 T3.4} 

{r4,l T4 f 2 T4,3 T4,4} 

The mask should be chosen such that a valid decoding solution exists for each 
combination of channels that can be missing, rxx can be chosen to be 0 in all 
15 cases with no loss of information. 

Example - "4/2" Configuration. Suppose n=4 and m=2. Let A, B, C, and 
D denote data bits for each of four disks in an array, and A', B', C and D' 
denote coding bits. The encoding functions may be defined as follows: 



20 



25 



A'=C (+) D {0011} 

B'=A (+) D equivalent tor= {10 0 1} 

C'=A (+) B {110 0} 

D'=B (+) C {0110} 

Decoding Functions: 

Disks A & B Missing: A=C(+)C , (+)D > B=C(+)D' 

Disks A & C Missing: A=D(+)B OB(+)D' 
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Disks A & D Missing: A=B(+)C D=B(+)B'(+)D' 

Disks B & C Missing: C=D(+)A' B=D(+)A'(+)D' 

Disks B & D Missing: B=A(+)C D=C(+)A' 

Disks C & D Missing: OA(+)A'(+)B' D=A(+)B' 

5 

Just as with the encoding functions, the decoding functions can be 
specified by use of a binary mask. Note that this encoding matrix mask is a 
circularly- symmetric matrix. When the mask is chosen with this property, a 
number of advantages are realized. A circularly-symmetric matrix can be fully 
10 described by a small subset of the complete mask. If we know the matrix is 
circularly symmetric, the complete encoding mask above can be fully specified 
using: 

{1} 

15 {1} 

{0} 

Note that this submatrix is the entire first column, except for the first element. 
The first element of the first column will always be zero, because it represents 

20 the trivial relation A'=A; it is never necessary to encode a disk's own data bits 
into a disk's coding bits. 

Most cases of disk failures can be represented as rotations of a smaller 
number of unique failure cases. Thus, for A 85 B missing, B 85 C missing, C & 
D missing, and D & A missing, all of these can be considered rotations of the 

25 case in which A & B are missing. Similarly, A & C missing and B 86 D missing 
can be considered rotations of the case in which A 85 C are missing. When 
decoding, the six cases of two disks missing can be represented as rotations of 
two unique cases. If the decode logic has this rotational ability built in, it is 
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necessary to consider and store only the decode masks of the two rotationally 
unique cases. 



Example - "5/2" Configuration. See FIG. 8, where n =5 disks total, m=2 
5 disks missing, p=l, resulting in d=3 data bits per disk and c=2 coding bits per 
disk (15 data bits to the array). As shown, the disks are designated A through 
E. FIG. 9 is an encode function matrix for redundancy data generation. Three 
data bits of disk A are designed Al, A2, and A3. The two coding bits are 
designated as A' and A". Data and coding from the other channels B, C, D, and 

10 E are represented similarly. The binary matrices are shown with asterisks for 
ones and spaces for zeroes. The submatrix corresponding to the case in which 
disks A&B are missing are circled in the 1 st table of FIG. 10. This submatrix 
appears below in the decoding solution for disks A&B missing. 

The coding mask selection is done during the design process, and does 

15 not need to be repeated during operation: The encoding function can be 

hardcoded into an arrangement of logic gates, or stored as a binary array mask 
that is retrieved to control the function of a logic array or in low-level software 
in a disk array system. As the number of disks in an array increases beyond 
four or five, it quickly becomes far more advantageous to use the binary array 

20 mask storage method. 

(1) Decoding Function for A&B Missing. The following discussion makes 
reference to FIG. 10 of the drawings. For a particular set of m devices missing, 
a decoding function exists such that the missing data is equal to a set of XOR 
operations on the surviving code and data. The decoding function is found 

25 from the solution of a set of simultaneous equations in modulo-2 number 
space using XOR operations. 

Mathematically, the decoding function is formed by taking a submatrix of 
the encoding function mask matrix, changing format to add columns for the 
coding bits, and then taking the modulo-2 matrix inverse of the subset of the 
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encoding function mask matrix. Identical operations are performed on the 
subset of the encoding mask matrix and the additional coding bit columns. m 
A solution set is shown in FIG. 10. The modulo-2 matrix inversions for each 
unique non-rotational case of channels missing are performed during the 
5 design process in order to confirm proper mask selection. The results of these 
inversions constitute the decode functions, and can be stored as binary array 
masks, just like with the encoding function. Alternatively, since the decoding 
masks are easy to calculate from the encoding mask, it is an implementation 
option for the disk array system to calculate the decoding masks when needed 
10 (e.g., in software) from the stored encoding masks. 

(2) For the decoding function for A&C missing, see FIG. 11. For 
convenience, the same encoding matrix is also reprinted. However, a different 
submatrix is highlighted (circled), to correspond with the case in which disks 
A&C are missing. A solution set is shown in FIG. 1 1 . For the decoding 

15 function for A&D missing, see FIG. 12. For convenience, the same encoding 
matrix is also reprinted. However, a different submatrix is highlighted, to 
correspond with the case in which disks A&D are missing. A solution set is 
shown in FIG. 12. 

(3) A&D Missing is a Rotation of A&C Missing. See FIG. 13 of the 

20 drawings. Since the encoding matrix was. designed to be circularly symmetric, 
the decoding function matrix mask for disk A&D missing is identical to a 
circular rotation of the case in which disks A&C are missing. This 
equivalence is demonstrated by starting to solve for disks A&D missing, and 
then circularly rotating this solution matrix to show that it is the same solution 

25 matrix as the case in which disks A&D are missing. To utilize the "A & C 

disks missing" solution for this case, recognize that disks A&D being missing 
is just a circular shift of variables by three positions from the case in which 
disks A&C are missing. Begin with the solution for disks A&C missing, and 
circularly rotate the variables three positions forward as follows: 
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A D, B => E, C => A, D => B, E => C. 

Note that the cases of missing disks (AB, BC, CD, DE, EA) are all circular 
5 rotations of each other, and the cases of missing disks (AC, BD, CE, DA, EB) 
are all circular rotations of each other. Thus, there aire ten total solution cases, 
but because of circular symmetry only two of them are computationally unique. 

The encoding method is simple enough to allow, if desired, coding of 
different sets of data differently on the same array, depending on the 
10 importance of each data set. For example, data with low importance can be 
coded to allow one disk of an array to fail, data with moderate importance can 
be coded to allow two disks to fail, and data of high importance can be coded to 
allow three disks to fail. Alternatively, an entire disk array can have all of the 
data coded the same way. 

15 

Embodying and Applying Wiencko Codes /Functions. Referring now to 
FIGs. 6 and 7, Wiencko codes/ functions (as well as all related methods and 
apparatus described herein) can be embodied in a number of different ways. 
For example, the codes/ functions described can be implemented in a hardware 

20 device 600, such as a programmable gate array (PGA), shown in FIG. 6. In this 
embodiment, simple exclusive-OR gates, such as an exclusive-OR gate 602, are 
easily combined in accordance with the desired functions. The 
codes/ functions can also be embodied and implemented using software as 
indicated in FIG. 7. Such software is embedded or stored on a disk 702 or 

25 memory 706, and executable on a computer 704 or a processor 708. For 
providing redundancy adjustability (described in more detail later below), 
simple switches (hardware) or mask set selection (software) may be used. Here, 
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the appropriate XOR logic circuits or mask sets are selected based on the 
control data. 



Related Methods And Apparatus. FIGs. 16.2, 17.1, 17.2, 18.1, and 18.2 
5 are flowcharts describing general methods which may be used in connection 
with the various embodiments. More particularly, FIG. 17.1 is a flowchart 
describing a method of generating redundancy data. Beginning at a start block 
1700, user data is received for storage on a disk array (step 1702). 
Redundancy data is generated based on the user data and the set of data 

10 redundancy functions (step 1704). (If multiple sets of data redundancy 
functions are made available in the system, the set of data redundancy 
functions are those selected as described below in relation to FIG. 16.2.) The 
user data and the generated redundancy data is stored on the disk array (step 
1706). The flowchart ends at a finish block 1708, but may repeat for other 

15 storage requests. 

Step 1704 of FIG. 17.1 may be accomplished in connection with the 
method described in relation to the flowchart of FIG. 17.2. Beginning at a start 
block 1710, the data is multiplied modulo 2 by a Wiencko bit matrix (step 
1712). More specifically, the stripe set data (n*H chunks) is expressed as a 

20 column vector to the right of the Wiencko matrix, and multiplied modulo 2 by 
the Wiencko matrix to get the stripe set parity (n*Q chunks). As apparent, this 
algorithm is very efficient (especially when the Wiencko matrix is sparse or 
nearly full). The flowchart of FIG. 17.2 ends at a finish block 1714. 

FIG. 18. 1 is a flowchart describing a method of recovering user data. 

25 Beginning at a start block 1800, a request for user data is received (step 1802). 
If all of the disks and data are available (step 1804), the user data is read from 
the disk array (step 1806). In this situation, data manipulation need not take 
place since the user data may be stored as "clear* data on the disks. The user 
data is then provided to the requester (step 1814). The flowchart ends at a 
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finish block 1808, but may be repeated for each request. On the other hand, if 
some of the data is determined to be unavailable at step 1804, the appropriate 
data recovery functions associated with the available disks or data are selected 
(step 1810). The user data is then recovered using the selected data recovery 
5 functions, available user data and redundancy data (step 1812). The user data 
is provided to the user (step 1814), and the flowchart ends at finish block 
1808. 

Step 1810 of FIG. 18.1 may be accomplished using the method described 
in the flowchart of FIG. 18.2. Beginning at a start block 1816, the stripe set 

10 data found in the n-m remaining disks is arranged as in the encoding 

algorithm, with zeroes in place of the data found in the missing disks (step 
1818). This arranged data is multiplied by that subarray of the Wiencko 
matrix that corresponds to array row numbers not found in S (rows 
corresponding to available disks) (step 1820). This step gives all the parity 

15 chunks for the stripes of the disks not found in S without the influence of the 
missing data. These parity chunks created are XOR'd with the parity actually 
found on the disks not found in S (step 1822); which by the linearity of matrix 
multiplication gives the non-S parity chunks that would have been produced 
by applying the Wiencko matrix to a data stripe set that included the missing 

20 data on the S disks but was zero on all the non-S disks. The appropriate 
inverse matrix that exists for S is applied to this parity (step 1824), which 
yields the missing data. If reconstruction is to be carried out, parity for the 
disks in S can be created by matrix multiplication by the subarray of the 
Wiencko matrix that corresponds to row numbers in S. 

25 The mathematical concepts involved in encoding and decoding are now 

described further by example. The simple example below involves a Wiencko 
array of type (5,2) with H=3 and Q=2: 

0 0 0|0 0 0|0 1 0|1 0 0|0 1 0 

30 I I I I 
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10 



15 



0 0 0 



0 10 
10 0 



10 0 
0 0 1 



0 10 
0 0 1 



0 0 0 
0 0 0 



0 0 0 



0 0 0 
0 0 0 



0 10 
10 0 



10 0 
0 0 1 



0 10 
0 0 1 



0 0 1 



0 0 0 
0 0 0 



0 0 0 
0 0 0 



0 10 
10 0 



10 0 
0 0 1 



0 0 1 



0 10 
0 0 1 



0 0 0 
0 0 0 



0 0 0 
0 0 0 



0 10 
10 0 



10 0 



10 0 
0 0 1 



w 



0 10 
0 0 1 



0 0 0 
0 0 0 



0 0 0 
0 0 0 



20 The missing disks, for simplicity, will be 0 and 1. The S={0,1} case reduces to 



dead data # live data 



25 



30 



35 



40 



45 



dead 
parity 

######## 

live 
parity 



0 0 0 
0 0 0 



0 10 
0 0 1 



0 0 0 
0 0 0 



0 0 0#0 1 0 

# 

0 0 0#0 0 1 



0 1 0|0 0 0#0 0 0|0 1 0|1 0 0 

# 

1 0 0|0 0 0#0 0 0|0 0 1|0 0 1 

############################# 

1 0 0|0 1 0#0 0 0|0 0 0|0 1 0 

# 

0 0 111 0 0#0 0 0|0 0 0|0 0 1 



-#- 



1 0 0#0 1 0 

# 

0 0 1#1 0 0 



0 1 0#1 0 0 

# 

0 0 1#0 0 1 



10 0 
0 0 1 



0 0 0 
0 0 0 



0 10 
10 0 



0 10 
10 0 



0 0 0 
0 0 0 



0 0 0 
0 0 0 



W ' • # xs 
######### = w 
ws # w 



where W M (dead to dead), XS (live to dead), WS (dead to live) and W'(live to live) 
are subarrays of the Wiencko array. (The same approach will work for deads 
that are not together, but is more complicated to visualize.) 
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If D is a data vector, then we similarly can write 



d» 1 

5 D = ### 

D' 

where D" is the part on the dead channels and D' on the live channels. 
Similarly for a parity vector 

10 

p.' • 

p = ### 
P 1 

15 Then by matrix multiplication the encoding procedure gives 

(1) P = WD 
or 

20 

P" W» • # XS D 1 1 

(2) ### = ########## ### 
P' WS # W« D' 

25 or 

(3a) P' 1 = (W 1 1 D ' 1 ) + (XS D • ) 
(3b) P' = (WS D' • ) + (W 1 D' ) 

30 

In the case we are working with (bit matrices), adding and subtracting are the 
same as "XOR" or " A ", and multiplication is the same as "AND" or "85". 

NOW SUPPOSE D' AND F ARE KNOWN BUT D" AND P" ARE LOST, but we are 
35 assured that (1) holds true for the known matrix W. We now define 

PI = W« D 1 

and 

40 

P3 = P 1 

so PI can be calculated and P3 read up from live data. Now set 

45 P2 = P3 - PI = P3"P1 = P3 + PI 

We get from (3b) that 
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(4) WS D" = P2 

and because of the definition of a Wiencko Matrix, there exists a matrix 

5 

-l 

JS = WS 



whence it follows that 

10 

(5) JS P2 = JS (WS D") = (JS WS) D" = D ,( 

which recovers the lost data. Another expression for (5) is 
15 P' 

(6) D 1 * = JS (I # W» ) ## 

D' 



where I is a live parity-sized identity matrix. This holds because 

20 

p f 

(I # W 1 ) ## = (I P 1 ) + <W« D') = P' + P3 = PI + P3 = P1~P3 
D 1 

25 We define the preconditioning matrix 
PC = (i # w ) 



We can then do (6) in two different ways: 

30 

p 1 

(6a) D* 1 = (JS PC) ## (single step) 

D 1 

35 P' 

(6b) D 1 ' b JS (PC ##) (two step) 

D' 

The single step method, which precsdculates (JS PC), is better for 
40 small n, the two step method is better for large n and sparse Wiencko 
matrices. 



Example: 

Matrix multiplication modulo 2 is used. # is not a number, but a 
45 separator between dead and live territory. 
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W D P 

5 o 

1 

000000#010100010 1 1 

000000#001001100 0 0 

010000#000010100 1 1 

10 100000#000001001 1 1 p 1 • 

################ # # ### 

100010#000000010 0 1 P3 

001100#000000001 1 0 

010100#010000000 1 0 

15 001001#100000000 0 0 

000010#100010000 1 0 

000001#001100000 0 0 

1 

0 

20 i 

D" and P" are lost. D' and P3=F are known, as is W. 

W D 1 PI 

25 

000000010 0 0 

000000001 1 1 

010000000 1 1 

100000000 0=0 

30 ioooioooo i i 

001100000 0 1 

1 

0 

1 



35 



45 



PI P3 P2 



0 1 1 

40 ioi 

1 0 = 1 
0 0 0 
10 1 
10 1 



WS JS 



100010 100010 

50 oonoo oinoi 

010100 inverse = 000101 

001001 010101 
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000010 
000001 



000010 
000001 



5 



JS 



P2 



10 



100010 
011101 
000101 
010101 
000010 
000001 



1 
1 
1 

0 

1 
1 



0 

1 
1 

0 

1 
1 



15 And thus, D n is recovered. P M can be recovered by encoding from D M 
and D'. 

Detection of Burst Errors With Use of Wiencko Codes. The following 
discussion relates to an optional technique for error detection with use of 
20 Wiencko coding. This technique is especially applicable to burst errors. In 
some applications, it may be used as the primary error detection technique 
without use of any other error detection/ correction codes. The methods now 
described may be similarly embodied in hardware or software as described 
herein. 

25 The corresponding concepts for a Wiencko array — a row or column 

whose constituent entries are H by Q matrices — are named a "channel row 
array" or "channel column array" to distinguish from these. In cases where the 
context is clear, either a channel row array or a channel column array may be 
called a "channel". A channel, in either case, is numbered from Oton-1. The 

30 "intersection of a row with a channel" is the intersection of a row with (the 

equivalent matrix of) a channel column array. It is always wholly contained in 
one entry matrix of the Wiencko array, and always is a row of H bits. The 
"intersection of a column with a channel" is the intersection of a column with 
(the equivalent matrix of) a channel row array. It is always wholly contained in 

35 one entry matrix of the Wiencko array, and always is a column of Q bits. 
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Either of these is called "zero" if all its bits are zero, otherwise it is called 
"nonzero". A set of bits is called "related" if each bit comes from a different 
chunk of the same stripe set, and each bit is at the same bit offset from the 
start of its chunk. The maximum number of bits that can be related is 
5 n*(H+Q). A set of related bits is called a "related set", and a maximal set of 
related bits is a "full related set". 

The error detection technique is as follows. At all times, both data and 
parity are received from all n channels. To detect faulty channels, the Wiencko 
encoding procedure is applied to the data and the results compared with the 

10 parity over a stripe set or a number of stripe sets, or over a portion of a stripe 
set whose bits have the same relative offsets in their respective chunks. All 
parity chunks which exhibit a discrepancy are noted. 

Definition (error bit and delta bit): The "error bit" at position k in a parity 
chunk is the XOR of the value read at k of that parity chunk and the value 

15 calculated using the Wiencko encoding procedure from the values at k of the 
corresponding data chunks. The a delta bit" of a certain bit location is the XOR 
of the correct value of the bit and the value read on a faulty channel. 

Definition (zero favor): Let R be a random variable that takes the values 
0 and 1, 0 with probability p and 1 with probability q = 1-p. The "zero favor" of 

20 R is defined as 

(1) z(R) = p - q 

Lemma (zero favor of XOR): Let Rl and R2 be two independent random 
variables taking the values of 0 and l. The zero favor of the XOR of Rl and R2 
is the product of the zero favors of Rl and R2. 
25 Proof: Let R be the XOR of Rl and R2. Using the obvious notation it 

follows from independence that 

(2) p = plp2 + qlq2 

(3) q = plq2 + qlp2 

so it follows that 
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(4) z(R) - pl(p2-q2) + ql(q2-p2) « (pl-ql)(p2-q2) = z(Rl)z(R2) 
which completes the proof. 

Theorem (Error Detection): (A) Suppose T is a set of faulty channels, so 
that T - all channels not in T = a set of error-free channels. Any parity chunk 
5 on a channel of T whose row in the Wiencko matrix has a zero intersection 

with all channels of T will show no discrepancy when compared with the parity 
computed from the data in its stripe set. (B) Let the assumptions be as in (A) 
and, further, suppose that for a set of k full related sets, all delta data and 
parity bits from channels of T are independently random (a burst error) such 
10 that each bit has a zero favor of absolute value less than or equal to u< 1. 

Then the probability of a nonzero error bit at any of the parity locations 
in any of the k related sets is always greater than or equal to 

(1 -u r+ *)/2 

15 where r is the number of ones in the intersection of its row of the Wiencko 
matrix with all the channels of T, and s is 1 if the parity is in T and 0 if the 
parity is not in T. 

Proof: (A) is a trivial consequence of the encoding scheme as described 
earlier above. (B) follows because the error bit is the XOR of r+s bits, each with 
20 a zero favor between u and -u. (Data or parity bits that are not in T do not 

contribute to the error bit, nor do data bits that meet a 0 in this parity bits row 
of the Wiencko matrix.) Applying the Lemma to the XOR of these r+s random 
variables gives 

_ u r+s <= z (error bit) <= u r+s 

25 

from which the claim follows. This completes the proof. 

As a consequence of the theorem, it is extremely probable for a large 
burst size that all the parity chunks not included in (A) of the theorem will, in 
fact, exhibit errors. Thus, examining which parity chunks exhibit errors, in 
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conjunction with the pattern of zeroes in the Wiencko matrix, yields high 
probability information on which channels are good and which exhibit errors. 

The error detection technique depends on the Wiencko matrix. For each 
set T of faulty channels there is a unique maximal set R(T) of parity chunks 
5 that satisfy the hypotheses of (A) of the theorem. R(T) may be empty. Clearly if 
T is a subset of U then R(U) is a subset of R(T). Also if T is empty then R(T) is 
all the parity chunks, where if T is nonempty then R(T) is not all the parity 
chunks (it must exclude, for instance, the parities in channels of T). A parity 
row "hits" a channel if either that parity chunk is a part of that channel, or that 

10 row in the Wiencko matrix has a nonzero intersection with that channel. R(T) is 
the set of parities that do not hit any member of T. A Wiencko matrix is 
"favorable to mc", for 0<=mc<= : m, if every T of mc faulty channels has the 
property that R(T) is strictly greater than R(Tp) for any Tp that strictly includes 
T. That is, for each channel not in T, at least one parity in R(T) hits that 

15 channel. If there are no parity errors in R(T) during a long burst, it can be 
concluded that the bad channels are a subset of T. The reason is as follows: 
Suppose the one of the channels not in T is bad. That channel hits at least one 
of the parities in R(T) and therefore, by (B), the chances of an error on that 
parity over the long burst are very high. This error however is a parity error in 

20 R(T). 

A simple example below is a Wiencko array of type (5, 2) with H=3 and 
Q=2, which when transposed becomes a Wiencko array of type (5, 3) with 
H=2 and Q=3: 

25 0 1 2 3 4 - hits (5, 2) 

0 0 0|0 0 0|0 1 0|l 0 0|0 10 x xxx 

I I I I 

0 o o|o 0 0|0 0 l|0 0 l|l 0 0 x xxx 

30 I I I I 

0 i ojo 0 o|o 0 0|0 1 0|1 0 0 x x x x 

I I I I 

1 0 0|0 0 0|0 0 0|0 0 1|0 0 1 xx xx 
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20 s r s 

In the (5, 2) case, this technique can distinguish any one-channel loss from 
any other. The (5,3) case is more interesting. The technique can distinguish 
any two-channel loss from any other in this case. For instance, marked with 
25 an r is the set R({0,1}) and with an s the set R({0,2}). 

Detailed Practical Application of Codes and Algorithms for Data Storage. 
Referring now to FIG. 14, a block diagram of a system 1400 embodying 
inventive aspects is shown. System 1400 comprises a network and a network- 

30 attached server (NAS) 1402, where the network is represented in part by a hub 
or switch 1404 connected to a plurality of hosts 1406. NAS 1402 includes a 
host 1408 connected to a plurality of disk drives 1412 through a bus interface 
1410. Disk drives 1412 are "packaged" in a number of preferably six, twelve, 
or eighteen. Host 1408 may also be connected to a control terminal 1424 

35 through a port 1426. Bus interface 1410 may utilize any suitable bus 

technology, for example, Peripheral Component Interconnect (PCI), Small 
Computer System Interface (SCSI), Fibre Channel, or Integrated Drive 
Electronics (IDE). In addition, control terminal 1424 may be any suitable 
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control device, such as a VT-100 terminal, and port may be a COM1 port. 
Alternatively, control terminal 1424 could be used with a Telnet connection. 

Host 1408 utilizes an operating system 1412, a device driver 1414, a 
device driver 1416, and a serial I/O driver 1418. Host 1408 also includes a 
5 redundancy data generator/ data recovery component 1422, which 

communicates with device driver 1416 in accordance with an Application 
Program Interface (API) 1420. Component 1422 also communicates with bus 
interface 1410 for use in connection with disk drives 1412. Operating system 
1412 may be any suitable operating system, such as Disk Operating System 

10 (DOS), NT, Unix, Linux, etc. Coupled to the network, device driver 1414 may 
be implemented with one of a number of different technologies, such as PCI, 
10/ 100BaseT (high-speed Ethernet), Fiber Distributed Data Interface (FDDI), 
or Copper Distributed Data Interface (CDDI). 

Component 1422 is operative with disk drives 1412 as described above. 

15 Component 1422 embodies inventive principles described above (and below), 
and provides system 1400 with the advantages. On one hand, component 

1422 may provide a fixed amount of data redundancy in the system. 

♦ 

Alternatively, component 1422 may provide for a selection or adjustment of 
data redundancy. Control terminal 1424 is an administrative terminal, utilized 

20 by one who has appropriate rights to control the particulars of system 1400. If 
component 1422 allows for adjustments to data redundancy, control terminal 
1424 provides the interface to input such control data. The amount of 
redundancy to be provided may be in the form of the maximum number of 
drives that can be lost simultaneously, or some other value or indication. A 

25 user of one of hosts 1406 may alternatively provide control data to adjust the 
amount of data redundancy, given the appropriate system rights and 
authentication . 

FIG. 15 is a schematic block diagram of a controller 1500 for a disk 
array. Controller 1500 includes a processor 1502, a non-volatile memory 
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1504, a host interface 1506, a cache 1508, and a plurality of I/O processors 
1510. Non-volatile memory 1504, which may be (e.g.) a read-only memory 
(ROM) or electrically erasable/ programmable ROM (EEPROM), has software 
stored therein for execution by processor 1502. The plurality of I/O processors 
5 1510 include I/O processors 1512, 1514, and 1516, which may be (e.g.) SCSI 
I/O processors (SIOPs). Controller 1500 also includes circuitry which provides 
for redundancy data generation and data recovery as described above, and is 
shown as a PGA 1520. All of these components are coupled to a bus 1518. 

A typical command flow for a "write" operation for controller 1500 is now 

10 described. A host issues a write command to controller 1500 to write data to a 
logical drive. Processor 1502 initiates a command to PGA 1520 to setup the 
appropriate instruction type. Processor 1502 sets up the host interface 1506 
to transfer data to cache 1508. Upon completion of the transfer, host interface 
1506 notifies processor 1502. Processor 1502 determines that some data may 

15 be required from SCSI disks for parity computation, and instructs the I/O 

processors 1510 to retrieve this data. I/O processors 1510 transfer data from 
the disks through PGA 1520 to cache 1508 and notify processor 1502 upon 
completion. Processor 1520 sends this data through PGA 1520 to thereby 
compute the appropriate redundancy data. Processor 1502 instructs I/O 

20 processors 1510 to transfer the data, as well as the redundancy data, to the 
disk drives. I/O processors 1510 transfer data from cache 1508 to the disks 
and notify processor 1502 upon completion. Processor 1502 in turn notifies 
the host. 

A typical command flow for a "read" operation for controller 1500 is now 
25 described. A host issues a read command to controller 1500 to read data from 
a logical drive. Processor 1502 initiates a command to PGA 1520 to setup the 
appropriate instruction type. Processor 1502 determines that the data resides 
on multiple disks and instructs I/O processors 1510 to retrieve data from the 
disk drives to cache 1508. I/O processors 1510 transfer data from the disks 
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through PGA 1520 to cache 1508 and notify processor 1502 upon completion. 
The data is transferred through PGA 1520 to either provide a "pass- through," 
or data recovery as described above if some disk or data is unavailable. 
Processor 1502 sets up host interface 1506 to transfer data from cache 1508 to 
5 system memory. Upon completion, host interface 1506 notifies processor 
1502, which in turn, notifies the host. 

Preferably, the data redundancy generation and the data recovery is 
performed using a special processor technology which allows operations to be 
performed simultaneously on multiple units of data. This technology is 
10 available today in the form of Multimedia Extensions (MMX) by Intel 
Corporation. 

" Disk Loss Insurance" Adjustability. Another aspect of the invention 
gives users a choice of how much redundancy to have as part of a disk array. 

15 The selected redundancy is made immediately and continuously useful 

throughout the operational life of the disk array. It is a natural extension of 
traditional RAID technology, one that offers performance advantages no matter 
what degree of redundancy in the disk array is desired. 

Referring now to FIG. 16.2, a flowchart describing a method for use in 

20 redundancy adjustability is shown. Beginning at a start block 1600, control 
data that is indicative of an amount of data redundancy desired in the disk 
array is received (step 1602). The control data is indicative of the value "m" 
and may take a variety of forms; for example, it may be the value m or m/n. 
Based on this control data, one of multiple sets of data redundancy functions 

25 (or codes) are selected (step 1604). One of multiple sets of data recovery 
functions (codes) is selected based on the control data (step 1606). The 
flowchart ends at a finish block 1608. This method may be performed once to 
configure the system, or may be executed each time data is stored on the disk 
array (FIG. 17) and each time data is read from the disk array (FIG. 18). 
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Referring now to FIG. 16.1, a schematic block diagram of one example of 
an adjustable redundancy and recovery apparatus 1610 is shown. In this 
example, apparatus 1610 is operative in connection with five disks (n=5) and m 
is adjustable. Apparatus 1610 includes a plurality of function set components 
5 1612 coupled to selectors 1622 and 1624. In this example, there are four 

function set components, namely function set components 1614, 1616, 1618, 
and 1620 associated with m=l, m=2, m=3, and m=4 respectively. Selectors 
1622 and 1624 are operative to select one of function set components 1612 for 
redundancy data generation and data recovery. Redundancy control signals 
10 are fed to selectors 1622 and 1624 to establish which one of function set 

components 1612 are selected. Apparatus 1610 may be implemented in many 
suitable ways, such as with software or hardware as described in relation to 
FIGs. 6 and 7. 



15 Conventional RAID Comparisons. RAID-1, "disk mirroring," allows for 

two disks worth of information to be copied to two other disks. RAID-1 fails, 
though, if the "wrong" two of the four disks fail. A "4/2" configuration allows 
any two of the four disks to fail while still allowing the original two disks of 
information to be fully recovered. Because RAID-1 solutions are costly, many 

20 databases rely strictly upon RAID-5 with striping and parity for protection 

against drive failure. However, RAID-5 supports continued operation only in 
the event of a single inoperable drive at any one moment. Losing two or more 
drives under RAID-5 brings operations quickly to a halt. For the cost of adding 
just one more drive, storage system 100 mitigates the risk of data loss by 

25 providing the means to sustain up two drive failures. 

One of the better conventional protections for critical information today 
is accomplished through RAID-1 (mirroring), overlaying RAID-5 (striping with 
parity) and then adding a global hot spare. For example, if data consumes four 
disk drives, then reliability can be improved by replicating this data on a 
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second "stack" of four drives. Within each stack, however, losing just one drive 
would make the whole database useless. To enhance reliability, each mirrored 
stack can be configured as an individual RAID-5 system. Since using parity 
adds the need for an additional drive, user data and parity information are how 
5 striped across five drives within each stack. This provides protection against 
the loss of a single drive within each stack. So, from an original database that 
required just four drives, this RAID configuration has grown to include: four 
drives for the original data; four drives for the mirrored data; one parity-drive 
(equivalent) for each stack (two in total); and one global hot spare (standby 

10 drive on which data cam be rebuilt if a drive fails). 

Referencing back to FIG. 2, a conventional RAID architecture shown 
requires a total of eleven disk drives: Here, seven drives have been added to 
protect data on the four original drives. The configuration can recover from a 
failed drive in either stack. Even if all the drives in one stack failed, the 

15 remaining drives in the surviving stack would still provide access to critical 

data. However, in this case, only one drive failure in the remaining stack could 
be tolerated. If multiple drive failures occur within each stack, the data cannot 
be recovered. 

For comparison, an exemplary configuration based on principles of the 
20 invention is shown in FIG. 3, where equal coverage against multiple drive 
failure is achieved. This configuration provides protection against two-drive 
failure, but at a much less cost and with superior performance: requires only 7 
disk drives compared to 1 1 for traditional RAID; doesn't tax system resources 
by requiring double "writes"; has faster transfer rates; requires less 
25 administrative overhead; if these disk drives cost $1,000 each, for example, the 
inventive apparatus saves $4,000 while providing better insurance, since any 
two random drives can fail and the system will continue to properly function. 

The a m=r case of the inventive techniques may be viewed as a functional 
equivalent of a RAID-5 array with an efficiency advantage. With the inventive 



44 



BNSDOCID: <WO. 



0110043A1_I_> 



WO 01/10043 PCT/US00/20398 

techniques, both data and coding are placed on all of the disks. When no disks 
have failed, the inventive techniques may make slightly better use of the disk 
array for data reads because all n spindles of the array are utilized to retrieve 
clear data with no need for decoding. 
5 An additional performance advantage of the inventive techniques is 

realized by converting passive a hot spares" into active contributors to the disk 
array. All running disks can be fully employed, which means complete 
utilization of all spindles for better writing and reading efficiency. This also 
prevents the problems of hot spares possessing latent defects, defects that 

10 might not become known until the hot spare is actively used in the array. 

In a traditional RAID-1 (or 0+1,5+1, etc.) storage configuration, with 
data mirrored on two independent SCSI channels, all data could be lost in one 
channel and operation would continue. However, if more than one drive failure 
concurrently occurs in both mirrored channels, then the entire storage system . 

15 becomes inoperable. With a storage system according to the invention, on the 
other hand, multiple drive failures are sustainable. 

As described, the inventive aspects are based on a pattern with the 
matrix designation n.m, where n>m>0, and n is the total disk count and m is 
the number of disks that can be lost. According to one aspect of the invention, 

20 any combination of up to m disks can be lost and the amount of data stored is 
equal to n-m, which is the theoretical maximum when m disks are lost. The 
read speed is n and the write speed is n-m, which are also theoretical maxima. 
Using some of these performance values, the following table compares the prior 
art with some inventive embodiments. 

25 





A 


H 


H 


B 


H 


H* 


C 


H 


H 


Case 


0+1 


10.2 


10.5 


5+1 


10.3 


10.6 


5+5 


9.3 


9.5 


Safe Loss Count 


1 


2 


5 


3 


3 


6 


3 


3 


5 
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Max Loss Count 


5 


2 


5 


6 


3 


6 


5 


3 


5 


Data Capacity 


5 


8 


5 


4 


7 


4 


4 


6 


4 


Read Speed 


#5 


10 


10 


#5 


10 


10 


' #3 


9 


9 


Write Speed 


5 


8 


5 


4 


7 


4 


4 


6 


4 



# These speeds can be improved on certain specialized operating systems. 



Other Related Methods, Apparatus, and Applications. FIG. 19 is a block 
5 diagram of a communication system 1900. In this embodiment, 

communication system 1900 is a radio frequency (RF) communication system 
providing for wireless communications. Communication system 1900 includes 
a communication device 1902 having an antenna 1904, and a communication 
device 1906 having an antenna 1906. Communication devices 1902 and 1906 

10 can be portable or non-portable electronic devices, such as portable 

telephones, personal digital assistants (PDAs), computers (desktop or laptop), 
fixed base stations, etc. 

As shown in FIG. 20, communication device 1902 is shown to have an 
RF transceiver 2002 (an RF receiver and transmitter) coupled to antenna 1904, 

15 a memory 2004, a processor 2006 (or processors), and an input/ output (I/O) 
device(s) 2008. Processor 2006 may be or include one or more 
microcontrollers, microprocessors, digital signal processors, etc. I/O device 
2008 may be or include memory for storing and retrieving data, a speaker 
and/ or microphone for audio communication, a display for text or video data, 

20 etc. 

Generally, communication device 1902 embodies similar features as 
described above in relation to Wiencko coding and operates accordingly, and 
otherwise operates in a conventional manner using conventional components. 
The inventive components of communication device 1902 may include software 
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and/ or hardware to implement the redundancy generation and recovery 
functions as described above. For example, communication device 902 may 
include a PGA as described above, or software stored in memory 2004 for 
execution by processor(s) 2006. The variables m and n may be any suitable 
5 numbers, and may remain fixed during operation. 

FIG. 21 is a representation of a data format in communication system 
1900 of FIG. 19. Variables m, n, H, Q, and D are represented and are defined 
similarly as above. A plurality of channels are shown, channels 1,2, and 3 or 
channels 2102, 2104, and 2106, respectively. The channels may be formed 

10 using frequency division, time division, code division, etc., techniques. Since 
there are three channels in this example, n=3. Here, data from all three 
channels combine to form the user data desired. For example, a grouping 
2108 from each of channels 2102, 2104, and 2106 form what would be a stripe 
set in a disk array. Similarly, a grouping 2110 forms what would be the next 

15 stripe set in the disk array. 

FIG. 22 is the same data format as shown in FIG. 21, but where it is 
established that m=2 (as an example) and that particular data groupings are 
unavailable (due to, e.g., data errors). The unavailability is represented by an 
"X n over the data that are unavailable. In accordance with the present 

20 invention, user data are recoverable when any combination of two channel 
failures occur (m=2). For data grouping 2108 in FIG. 22, the user data is 
recoverable since only a single a D n from channel two is unavailable. For the 
next data grouping in FIG. 22, data grouping 21 10, the user data are 
recoverable as well since only two "D"s are unavailable (data from channels two 

25 and three). For the next data grouping in FIG. 22, the user data are partially 
recoverable since three a D"s from all three channels are unavailable (> m), 
where some data are "clear" bits. Finally, for the next data grouping in FIG. 
22, the user data are recoverable since only two U D"& are unavailable (data from 
channels one and two). Good performance preferably involves a system in 



47 

BNSDOCID: <WO 0110043A1_L> 



WO 01/10043 PCT/US00/20398 

which the channel demarcations are chosen such that errors tend to 
concentrate in a few channels rather than be spread across many channels. 

FIG. 23 is another example of a data format which may be utilized in 
connection with communication system 1900 of FIG. 19. As an example to 
5 illustrate this format, a data grouping 2302 represents what would otherwise 
be a stripe set in a disk array. Similarly, a data grouping 2304 represents what 
would otherwise be the next stripe set in the array. Referring to FIG. 24, where 
m=2 and n=3, the user data is recoverable in both situations shown since no 
more than two "time slots" in each data grouping are unavailable. 

10 FIG. 24. 1 is a flowchart describing a method of processing data in a 

communication device and system, such as that described in relation to FIGs. 
19-24. At one or more remote stations, RF signals are modulated with the data 
and transmission is provided over a plurality of n channels. Beginning at a 
start block 2400, the modulated RF signals are received and demodulated (step 

15 2401). The data is received in n data groupings (step 2402), some or all of 
which may be adversely affected with errors. For each data grouping, errors 
are detected and corrected to the extent possible using conventional error 
detection and correction techniques (e.g., with suitable conventional error 
detection and correction codes). Some data groupings, however, may have 

20 errors that are not correctable with use of such conventional techniques. 

If data recovery is then necessary (step 2406), the data is recovered using 
a data recovery function(s) (step 2408). The appropriate data recovery 
function(s) is selected based on which data groupings are in error, even after 
application of conventional techniques. If no data recovery is necessary at step 

25 2406, or after data recovery is performed at step 2408, the data is processed 
(step 2410). Processing the data in step 2410 may involve processing a 
predetermined function based on the data (e.g., for command or control data) 
or processing the data for use with an I/O device (e.g., for audio, text, or video 
data). Other suitable error detection techniques may be used in relation to 
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steps 2404 and 2406 (e.g., low signal strength indications, low bit error rates 
(BER), Wiencko code error detection, etc.). 

Communication system 1900 may also operate to provide data 
redundancy adjustments similar to that provided and described in relation to 
5 disk arrays. Redundancy adjustability may be provided — even dynamically 
during device operations. For example, communication system 1900 operates 
such that more redundancy is provided (i.e., m is incremented) in response to 
the detection of inadequate communication, and less redundancy is provided 
(i.e., m is decremented) otherwise. Also preferably, more redundancy is 

10 provided in response to less system capacity made available (due to, e.g., other 
users), and less redundancy is provided in response to more system capacity 
made available. The control of such adjustments may be made by the base or 
mobile station, or both. 

Other inventive aspects described herein involve local area networks 

15 (LANs). FIG. 25 is a block diagram illustrating such an environment. A LAN 
2500 includes a department 2504 connected to a switch 2506, a department 
2508 connected to a switch 2510, and a server 2512. A switch controller 2502 
has a number of ports which are connected to switches 2506, 2510 and server 
2512. More particularly here, switch controller has twelve ports, the first four 

20 of which are connected to switch 2506, the second four of which are connected 
to switch 2510, and the last four of which are connected to server 2512. 

Generally, the LAN and the entire system in FIG. 25 embody similar 
features as described above in relation to Wiencko coding and operate 
accordingly, but otherwise operate in a conventional manner using 

25 conventional components. The inventive components may include software 
and/ or hardware to implement the redundancy and recovery functions, as 
described above. For example, switches 2506, 2510 and server 2512, as well 
as switch controller 2502, may include a PGA as described above, or software 
stored in memory for execution by a processor. As shown in FIG. 25, the 
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system is configured for the situation where m=2 and n=4. The variables m 
and n may be any suitable numbers, and may remain fixed during operation. 
Redundancy adjustments may be provided as well, similar to that described 
above. 

5 Other inventive aspects described herein involve wide area networks 

(WANs). FIG. 26 is a block diagram illustrating such an environment. As 
shown, a WAN 2600 includes a switch controller 2602 connected to one or 
more telephone companies (as shown, AT&T, MCI, and Sprint). More 
particularly, switch controller 2602 has twelve ports, the first four of which are 

10 connected to AT&T, the second four of which are connected to MCI, and the 
last four of which are connected to Sprint. 

Generally, the WAN and the entire system in FIG. 26 embodies similar 
features described above in relation to Wiencko coding and operates 
accordingly, and otherwise operates in a conventional manner using 

15 conventional components. The inventive components may include software 
and/or hardware to implement the redundancy and recovery functions, as 
described above. For example, switch controller 2602 and the service 
companies, may utilize a PGA as described above, or software stored in 
memory for execution by a processor. As shown in FIG. 25, the system is 

20 configured for the situation where m=8 and n=12. The variables m and n may 
be any suitable numbers, and may remain fixed during operation. 
Redundancy adjustments may be provided as well, similar to that described 
above. 

Other inventive aspects described herein involve multiple servers, or 
25 multiple servers and the Internet. FIG. 27 is a diagram illustrating such an 
environment. A system 2710 includes a plurality of servers 2712, such as 
servers 2702, 2704, and 2706. Servers 2712 may be connected via the 
Internet 2700. An end user 2708 may connect to the Internet 2700 to access 
data from one or more of servers 2712. 



50 



BNSDOCID: <WO 01 10043A1_I_> 



WO 01/10043 PCT/US00/20398 

Generally, system 2710 embodies similar features as described above in 
relation to Wiencko coding and operates accordingly, and otherwise operates in 
a conventional manner using conventional components. The data is spread 
over multiple servers in a manner similar to that described in relation to a disk 
5 array. The inventive components may include software and/ or hardware to 
implement the redundancy and recovery functions, as described above. For 
example, servers 2712 and user 2708 may operate in connection with a PGA as 
described above, or software stored in memory for execution by a processor. 
The variables m and n may be any suitable numbers, and may remain fixed 

10 during operation. Redundancy adjustments may be provided as well, similar to 
that described above. 

In an alternate embodiment, system 2710 may operate using a "temporal 
proximity* technique. The data is spread over the multiple servers in a manner 
similar to that described in relation to a disk array. To illustrate, suppose m=2 

15 and n=3, and that servers 2712 are geographically separated by some large 
distance. For example, server 2702 (designated "A") is located in California , 
server 2704 (designated "B") is located in Chicago, Illinois, and server 2706 
(designated a C) is located in New York. At different times of the day, servers 
2712 are loaded differently. Therefore, when user 2708 requests data from any 

20 one of servers 2712, the time it takes to retrieve that data depends upon which 
server is accessed. Exemplary differences in data access time from servers 
2712 are summarized in the tables of FIGs. 28, 29, and 30. As shown in these 
tables, data access time is best from server 2702 at 7:30 a.m., from server 
2704 at 8:30 a.m., and from server 2706 at 1 1:30 a.m. 

25 In operation, user 2708 sends a request for some information (e.g., a file) 

from each one of servers 2712. In response to the requests, servers 2712 
operate in a parallel fashion to submit data from the file to user 2708. 
Eventually, one of servers 2702 will be the first to supply user 2708 with the 
data (i.e., packet or group of packets). Once user 2708 receives the data from 
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the first "winning" server, it ignores the other data submitted by the "losing" 
server(s). Since m=2 and n=3, user 2708 can afford to ignore this later sent 
data and recover the data using the techniques described herein. This method 
repeats for each packet or group of packets. Thus, in this embodiment, data is 
5 always made "available" from a single server. 

FIG. 3 1 is a flowchart describing a method of processing data in 
accordance with such temporal proximity techniques. Assume m=n-l. 
Beginning at a start block 3100, a data request is submitted to n servers (step 
3102). A response is received from one of n servers that is first in time (step 

10 3104). The data is constructed using a Weincko data recovery function and 
that received from the first responding server (step 3106). The appropriate 
data recovery function is selected based on which server first responded. 
Responses from the other n- 1 servers that are later in time may be ignored. If 
it is determined that all data have been received (step 3108), the flowchart ends 

15 at a finish block 3110 (but the method may repeat for subsequent requests). If 
all of the data have not been received, the method repeats at step 3104. This 
technique may be used in connection with values of m that are less than n-1 as 
well, where the first (n-m) responses are used to construct the data. 

In another embodiment related to FIG. 31, step 3104 receives a response 

20 that is both first-in-time and error-free (based on the use conventional error 
detection and/or correction techniques). Here, for example, the first-in-time 
response may have non-correctable errors, but the second-in-time response 
may have no errors and therefore be used to recover the data in its entirety. 

25 Further Related Methods, Apparatus, and Applications. Parity in 

memory and low level storage is intimately connected with low-level hardware 
and firmware so as to be transparent and not costly from the point of view of 
all data using applications including drivers. It is usually accomplished on the 
bit-level before the data is assembled into bytes or words. Though closely 
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r 

related to RAID, there are some differences: RAID must be able to operate as 
an overlay on any disks (or other storage devices) independent of the hardware, 
firmware or memory structure. It may also handle data in comparatively large 
blocks to avoid data access inefficiency, or in small blocks to avoid latency. 
5 File and system copying and backup has a relationship with mirroring 

and RAID-1 because a copy is made. However, RAID must be available at all 
times to recover on-the-fly from device loss. Backups are periodic in order to 
avoid overburdening the system: they therefore can return data that is out-of- 
date. But what they do return is unlimited by the nature of the failure which 

10 can be total, unlike RAID. 

Similar methods and apparatus may also be utilized to create a real-time 
backup capability that works in a fundamentally different way than present 
backup capabilities. Present "tape backup" schemes are an all-or-nothing 
proposition, and fail to accommodate the cases where a real-time backup, but 

15 not complete backup, is what fits the application. This application extends the 
RAID field in the system beyond that of the homogeneous disks. As an 
example, consider an 18-disk disk array, along with 5 additional storage 
elements, say 5 tape drives. The encoding may be setup for a "23/7* 
configuration, so that the equivalent of 16 disks worth of information is being 

20 stored. Here, the disk array's information alone would be sufficient to read all 
data, including if up to two disks failed in the disk array. If up to five 
additional disks failed, then data is still recoverable with a little help from the 
tape drives. 

In the exemplary case, the risk of losing all data if more than seven disks 
25 fail may be acceptable, but the backup capability may need to be updated in 
real-time. A downside to this approach is that decode is always necessary, 
because access to the slower tape drives when at least sixteen disks are 
available is never desired, and decode must be performed for five elements even 
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when all eighteen disks are available. However, this downside can be 
ameliorated using fast-acting decode logic. 

The example above applies techniques to disks and tape. However, the 
techniques may be applied to faster disks and slower disks, or a number of 
5 levels of faster, fast, moderate, slow, slower storage technologies, whether they 
be semiconductor, rotating media or streaming media. The parallel-element 
redundancy techniques of the present invention can be applied to systems 
employing a mixture of these media. 
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WIENCKQ CODES AND FURTHER METHODS TO OBTAIN ADDITIONAL ' 



CODES. 

A. Exemplary Weincko Codes, With code searching methods, encoding 
5 matrices have been found for many pairs of n and m. APPENDIX A below 

provides an exemplary list of currently available Wiencko codes. Since multiple 
instances of Wiencko codes exist for many pairs of m and n, those with the 
most favorable implementation characteristics (smaller data and code chunk 
sizes, sparseness of encoding matrix) have been selected for application. 

10 

B. Code Determining /Validation Method. Further description is 
provided as source code in APPENDIX B. 

C. Further Methods. Techniques have been developed to identify 

15 encoding matrices for additional pairs of m and n, especially for higher values 
thereof, and are provided as source code in APPENDIX C. 

D. Duality Methods. These are described in APPENDIX D along with 
source code associated therewith. 

20 

E. Wiencko Codes Where m=l and m=(n-lh The m=l and m=(n-l) cases 
are shown in the code list in APPENDIX A, although it is clear at a glance that 
they are all essentially the same. It can be mathematically proven that the 
patterns shown will always work for m=l and m=(n-l), for any value of n>=2. 

25 The following are the proofs using n=5 cases as examples. 

Example: 

5-141 
1000 - 
30 0100 - 
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0010 - 
0001 - 

The general solution in the m=l case shows an (n-l)x(n-l) identity matrix 
5 following the numbers nln-1 1 thus for p= 1 . The actual parity matrix is 
expanded from this by adding a row of zeroes on the top then rotating and 
copying this (n-l)xn column to the right until an (n*(n-l))xn parity matrix 
exists. There is only one case to test - removal of the first disk (all other 1 disk 
removals are rotations of this). The resulting square matrix (parity rows from 
10 the last n-1 disks, data columns from the first disk) is an identity matrix, and 
is therefore invertible. 

Example: 

5 4 14 
15 0 - 
0 
0 
1 

0 - 
20 0 
1 
0 

0 - 

1 

25 0 
0 

1 - 
0 

0 

30 0 

The general solution in the m=n- 1 case consists of a column of n- 1 
columns, each of which has n-1 entries. The k-th little column (counting from 
1) has the n-k-th value (counting from 1) equal to 1, the others O. To get the 
35 actual parity matrix, as above, you add a little column on top and then copy to 
the right, rotating each time. As above, there is essentially only one test; 
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removing all the disks but the first. The square matrix uses data columns from 
the last (n-1) disks and parity rows from the first disk. The result is an identity 
matrix, which is always invertible. 

5 Conclusion. As readily apparent, the various inventive aspects described 

herein provide several advantages in simplicity and constitutes a breakthrough 
and a technological leap in the arts, as in the fields of data redundancy 
generation and recovery, data retrieval and storage, data communications, and 
network operations. The present invention is a particular breakthrough in the 
10 field of RAID architecture, which has long used rather limited techniques. 
Thus, the scope of all of the claimed inventions should be understood to be 
quite broad and warrant a broad range of equivalent structures and 
func tionali tie s . 
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APPENDIX A 



List of representative Wiencko codes. Mulitple solutions may exist for a given 
m and n, and thus the codes below are merely preferred examples. 

The first line associated with each set has four numbers; the first is n, the 



second is m, the third is H, and the fourth is Q (Q = D-H). It follows that 
p=gcd(H, Q). 



4 13 1 
100 - 
010 - 
001 - 

4 2 11 

1 - 

1 - 

0 - 

4 3 13 
0 - 

0 
1 

0 - 

1 

0 

1 - 

0 
0 

5 14 1 
1000 - 
0100 - 
0010 - 
0001 - 

5 2 3 2 
010 - 
100 
100 - 
001 
010 - 
001 
000 - 
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000 

5 3 2 3 
00 - 
01 
10 

10 - 

00 

01 

10 - 

01 

00 

00 - 

00 

00 

5 4 14 
0 - 

0 
0 

1 

0 - 
0 

1 

0 

0 - 

1 

0 
0 

1 - 

0 
0 
0 

6 15 1 
10000 - 
01000 - 
00100 - 
00010 - 
00001 - 

6 2 2 1 
10 - 
10 - 
01 - 

00 - 

01 - 

6 3 11 

0 - 

1 - 
1 - 
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1 - 
0 - 



6 
0 
1 
0 
1 
1 
0 
0 
0 

1 

0 



4 12 



6 5 15 

0 - 

0 

0 

0 

1 

0 - 

0 

0 

1 

0 

0 - 
0 

1 

0 
0 

0 - 

1 

0 
0 
0 

1 - 

0 
0 
0 
0 



7 16 1 
100000 - 
010000 - 
001000 - 
000100 - 
000010 - 
000001 - 

7 2 5 2 
10000 - 



A-l 
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01000 
00100 - 
10000 
01000 - 
00001 
00100 - 
00010 
00010 - 
00001 
00000 - 
00000 



7 3 4 3 

1000 - 

0100 

0010 

0001 - 

1000 

0100 

1000 - 

0010 

0100 

0000 - 
0010 
0001 

0001 - 
0000 
0000 
0000 - 
0000 
0000 



7 4 
000 
100 
010 
001 
001 
000 
100 
010 
000 
010 
100 
001 
100 
010 
000 
000 
001 
000 
000 



3 4 



4-4 
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000 
000 
000 
000 
000 



7 5 2 5 

00 - 

00 

00 

10 

01 

00 - 

00 

01 

00 

10 

10 - 

00 

00 

01 

00 

00 - 

10 

01 

00 

00 

10 - 

01 

00 

00 

00 

00 - 

00 

00 

00 

00 



6 16 
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0 
0 

1 

0 
0 

0 - 
0 

1 

0 
0 
0 

0 - 

1 

0 
0 
0 
0 

1 - 

0 
0 
0 
0 
0 



8 17 1 

1000000 - 

0100000 - 

0010000 - 

0001000 - 

0000100 - 

0000010 - 

0000001 - 



8 2 3 1 
Oil - 
100 - 
100 - 
010 - 
001 - 

000 - 

001 - 



8 3 5 3 

00000 - 

00001 

00100 

01000 - 

10000 

00100 

10000 - 

01000 

00010 
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10000 - 

00010 

01000 

00100 - 

00010 

00001 

00001 - 

00000 

00000 

00000 - 

00000 

00000 

8 4 2 2 
10 - 

pi 

10 - 
01 

00 - 
10 

10 - 
01 

00 - 
00 

01 - 
00 

00 - 
00 

8 5 3 5 

010 - 

000 

100 

000 

000 

000 - 

000 

100 

001 

010 

000 - 

100 

000 

010 

001 

000 - 

010 • 

000 

100 

001 

100 - 

010 

A-4 
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001 

000 

000 

001 - 

000 

000 

000 

000 

000 - 

000 

000 

000 

000 



8 

1 
1 

0 
0 
0 

1 

0 
0 

1 

0 

1 

0 

1 

0 
0 
0 
0 
0 

1 

0 
0 



6 13 



8 
0 
0 
0 
0 
0 
0 

1 

0 
0 
0 
0 
0 

1 



7 17 



A-8 
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/*{{{ includes*/ 

/* #include <sys/types .h> 

#include <sys/stat.h> 

#include <fcntl.h> 

ftinclude <termios.h> */ 
#include <stdio . h> 
#include <stdlib.h> 
/*}}} */ 
/*{{{ defines*/ 

tfdefine DEBUG 1 
ttdefine MAX_JDIM 256 
#define MAX_IDIM 3 06 
#define MAXJLENGTH 8192 
#define MAX__COUNT 1024 
#define FILENAME "file. joe" 

/*}}} */ 

main(int argc, char *argv[]) 
{ 

FILE * stream; 

char textline [512] , filename [128] , rotated [10] ; 
char joefile[MAX_JDIM] [MAX_IDIM] ; 
char jumps [400] [18] ; 

unsigned long tests [32] , testmatrix [32] , testm2 [32] , testm3 [32] ; 
unsigned long long ltests[64] , ltestmatrix [64] , ltestm2 [64] , ltestm3 [64] ; 
int n , m, D , P , datal ines , i , j , x , y , z , ndeads , al i ve , ndim , mtemp , mf lip ; 
/*{{{ steptobit and bittostep*/ 
/* 

* Use global n. 

* bitmatrix bits has 1 in position k if k-th disk is dead. 

* char steps [i] is a step from a bit to a next bit. 

* bits = steptobit (steps) 

* nbits = bittostep (steps, bits) 

* invert each other. Rotations are treated as equivalent. 
*/ 

int steptobit (const char steps []) { 
int test, ind, res; 
test = 1; 
res c 0; . 
ind = 0; 

while (steps [ind]) { 
res |= test; 

test <<= ( (int) steps [ind] ) ; 
ind++ ; 

} 

return res; 

int bittostep (int bits, char steps [] ) { 

int test, ind, shift, shiftlast, shiftfirst; 
test 1; 
ind = 0 ; 
shiftlast = n; 
shiftfirst = -1; 

for (shift«=0;shif t<n;shif t++) { 
if (bits&test) { 

if (shiftlast<shift) { 

steps [ind] «= (char) (shif t-shif tlast) ; 
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ind++; 
} else { 

shiftfirst = shift; 

} 

shiftlast « shift; 

} 

test <<= 1; 

if (shiftfirst 0) { 

steps [ind] = (char) (shiftfirst + n - shiftlast); 
ind++; 

} 

steps [ind] = 1 \0 1 ; 
return ind; 

} 

/*>}} */ , 

/*{{{ void showmatrix (comment , matrix) - use ndim for dimension.*/ 
void showmatrix (char comment [], unsigned long matrix []) { 
int j,i; 

char matints [33]; 
printf ( "%s : \n" , comment) ; 
mat int s [ndim] = ' \0 1 ; 
for (j=0; j<ndim; j++) { 

unsigned long murk = matrix [j]; 
unsigned long imask = 1; 
for (i=0 ; icndim; i++) { 

if (murk&imask) {matints [i] = • |'; 
} else {matints [i] = 'O 1 ;} 
imask <<= 1; 

} 

printf (" %s\n" , matints) ; 

} 

} 

/*}}} */ 

/*{{{ void showmatrix64 (comment, matrix) - use ndim for dimension.*/ 
void showmatrix64 (char comment [] , unsigned long long matrix []) { 
int j,i; 

char matints [65] ; 
printf ( ,, %s: \n n , comment) ; 
matints [ndim] « '\0 % ; 
for (j=0; j<ndim; j++) { 

unsigned long long murk = matrix [j] ; 
unsigned long long imask = 1; 
for (i«0;i<ndim;i++) { 

if ( (murk&imask) !-(long long)0) {matints[i] = • | ' ; 
} else {matints [i] = •O';} 
imask <<= 1; 

printf (" %s\n" , matints) ; 

} 

/*}}} */ 

/*{{{ int mulO multiplies fl by f2 to get p saving i (bits) dim of f2*/ 
int mul (unsigned long fl[], unsigned long f 2 [] , unsigned long p[], 

int downfl, int jdim) { 

int k,j; 

for (k«=6;k<downf l;k++) { 



104 



BNSDOCID: <WO 0110043A1J_> 



WO 01/10043 



PCT/US00/20398 



unsigned long nurp = f 1 [k] ; 
unsigned long temp = 0; 
for ( j«=0; j<jdim; j++) { 

if (nurp&tests [j] ) temp A = £2[j]; 

} 

p[k] = temp; 

} 

} 

/*}}} */ 

/*{{{ int mul64() multiplies lfl by If 2 to get lp caving bits dim of If 2*/ 
int mul64 (unsigned long long fl[], unsigned long long f2 [] , 

unsigned long long p[], int downfl, int jdim) { 

int k,j; 

for (k=0;k<downf l;k++) { 

unsigned long long nurp = f 1 [k] ; 
unsigned long long temp = 0; 
for (j=0; j<jdim; j++) { 

if (nurp&ltests [j] ) temp A = f2[j]; 

} 

p[k] = temp; 

} 

} 

/*}}} */ 

/*{{{ int det() tests testmatrix to ndim<=32*/ 
int det (void) { 

int i, j, k, ihit; 
for (j=0; j<ndim; j++) { 

ihit = -1; /* no hit yet */ 
for (i=j ; i<ndim; i++) { 

if (testmatrix [i] &tests [j] ) { 
if (ihit<0) { 

ihit = i; 
} else { 

testmatrix ti] A = testmatrix [ihit] ; 

} 

} 

} 

if (ihit<0) return 0; 
if (ihit>j) { 

unsigned long temp; 

temp = testmatrix [ihit] ; 

testmatrix [ihit] m testmatrix I j ] ; 

testmatrixlj] = temp; 

} 

} 

return 1; 

} 

/*}}} */ 

/*{{{ int det64() tests ltestmatrix to ndim<=64*/ 
int det64 (void) { 
int i, j, k, ihit; 
for (j=0; j<ndim; j++) { 

ihit « -1; /* no hit yet */ 
for (i«j ; i<ndim; i++) { 

if (ltestmatrix [i] fcltests [j] ) { 
if (ihit<0) { 
ihit - i; 
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} else { 

ltestmatrix [i] A = ltestmatrix [ihit] ; 

} 

} 

} 

if (ihit<0) return 0; 
if (ihit>j) { 

unsigned long long temp; 

temp = ltestmatrix [ihit] ; 

ltestmatrix [ihit] = ltestmatrix [j ] ; 

ltestmatrix [j ] = temp; 

) 

} 

return 1 ; 

} 

/*}}} */ 

/*{{{ int inv() inverts testmatrix to testm2 then checks; ndim<=32*/ 
int inv(void) { 

int i, j, k, ihit; 

/* create a unit matrix */ 

memcpy((void *)testm2, (void *)tests, 128); 
#if DEBUG 

showmatrix ( M Before inversion" , testmatrix) ; 
#endif 

memcpy((void *)testm3, (void *) testmatrix, 128); 
for (j=0; j<ndim; j++) { 

ihit = -1; /* no hit yet */ 
for (i= j ; i<ndim; i++) { 

if (testmatrix [i] fctests [j ] ) { 
if (ihit<0) { 

ihit = i; 
} else { 

testmatrix [i] A = testmatrix [ihit] ; 
testm2 [i] testm2 [ihit] ; 

} 

} 

} 

if (ihit<0) return 0; 
if (ihit>j) { 

unsigned long temp; 

temp = testmatrix [ihit] ; 

testmatrix [ihit] « testmatrix [j ] ; 

testmatrix [j ] «= temp; 

temp = testm2 [ihit] ; 

testm2 [ihit] = testm2 [j] ; 

testm2[j] = temp; 

} 

for (i=0; i< j ; i++) { 

if (testmatrix [i] fctests [j] ) { 
testmatrix [i] A = testmatrix [j ] ; 
testm2[i] A = testm2 [j] ; 

} 



memcpy((void *) testmatrix, (void *)testm3, 128); 
#if DEBUG 

showmatrix ( "After inversion" , testm2) ; 
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#endif 

/* now matmult testm2 by testmatrix and eee that it is unitmatrix */ 
mul (testm2 , testmatrix, testm3 , ndim, ndim) ; 
#if DEBUG 

showmatrix (" Product" , testm3) ; 
#endif 

for (j=0; j<ndim; j++) { 

if (testm3 [j] ! =tests [j] ) return 0; 

} 

return 1; 

} 

/*}}} */ 

/*{{{ int inv64() inverts ltestmatrix to ltestm2 then checks; ndim<=64*/ 
int inv64 (void) { 

int i, j, k, ihit; 

/* create a unit matrix */ 

memcpy((void *)ltestm2, (void *)ltests, 512); 
#if DEBUG 

showmatrix64 ( "Before inversion" , ltestmatrix) ; 
#endif 

memcpy((void *)ltestm3, (void *) ltestmatrix, 512); 
for (j=0;j<ndim; j++) { 

ihit o -1; /* no hit yet */ 
for (i=j ;i<ndim;i++) { 

if (ltestmatrix [i] fcltests [j] ) { 
if (ihit<0) { 

ihit = i; 
} else { 

ltestmatrix [i] A = ltestmatrix [ihit] ; 
ltestm2 [i] A = ltestm2 [ihit] ; 

} 

} 

} 

if (ihit<0) return 0; 
if (ihit>j) { 

unsigned long long temp; 

temp = ltestmatrix [ihit] ; 

ltestmatrix [ihit] « ltestmatrix [j ] ; 

ltestmatrix [j] = temp; 

temp = ltestm2 [ihit] ; 

ltestm2 [ihit] = ltestm2[j]; 

ltestm2[j] *= temp; 

for (i=0;i<j ;i++) { 

if (ltestmatrix [i] fcltests [j] ) { 
ltestmatrix [i] ltestmatrix [j ] ; 
ltestm2 [i] A = ltestm2 [ j ] ; 

} 

} 

memcpy((void *) ltestmatrix, (void *)ltestm3, 512); 
#if DEBUG 

showmatrix64 ("After inversion", ltestm2) ; 
#endif 

/* now matmult ltestm2 by ltestmatrix and see that it is unitmatrix */ 
mul64 (ltestm2, ltestmatrix, ltestm3 , ndim, ndim) ; 
#if DEBUG 
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showmatrix64 ( "Product ",ltestm3 ) ; 
# end if 

for (j=0; j<ndim; j++) { 

if (ltestm3 (j J l«ltests [j J ) return 0; 

} 

return 1; 

} 

/*}}} */ 

/*{{{ void testemO tests various matrices*/ 
void testem(void) { 
do { 

printf ("Enter test matrix in hex X X X X:"); 
scanf("%lX %1X %1X %1X" , fctestmatrix [0] , fctestmatrix (1] , 

fctestmatrix [2] , ^testmatrix [3] ) ; 
ndim = 4; 

printf ("%1X %1X %1X %1X: det was %d\n", 

testmatrix[0] , testmatrix [1] , testmatrix [2] , testmatrix [3] ,det() ) ; 
} while (testmatrix [0] ) ; 

} 

/*}}} */ ' 

/* START OF THE STATEMENTS */ 

if (argol) { 

strcpy (filename ,argv[l] ) ; 
} else { 

strcpy (filename , FILENAME) ; 

f or (i=0 ; i<32 ; i++) { 
tests [i] = !<<i; 

for ( i=0 ; i<64 ; i++) { 

ltests[i] = ((long long)l)<<i; 
ltestmatrix[i) = ((long long) (-1) ) «i; 

} 

ndim =0; 

printf ("\n\nTEST PROGRAM validate READS %s . . . \n\n\n" , filename); 
alive = 1; 

/*{{{ build the D to P matrix*/ 

stream = f open ( filename, "r") ; 
if (stream==NULL) { 

printf ("%s does not exist . \n" , filename) ; 

exit(l) ; 

printf ("%s exists. Reading in its data . \n", filename) ; 
if (fgets (textline, 511, stream) !=NULL) { 

sscanf (textline, "%d %d %d %d" , &n, &m, &D, &P) ; 

printf ("n=%d,m=%d,D=%d,P=%d\n",n,m,D,P) ; 

if (n&&m&&D&tP&&: (n>m) && ( (n*P) « ( (D+P) *m) ) ) { 
printf ("Confirmed correct ratio. \n"); 

} else {printf ("Error in ratio or zero entry . \n" ); exit (1) ; } 

} 

mtemp «= n-m; 

if ( (n>18) | | ( (m>5) &&(mtemp>5) ) | | ( (n*P) >MAX_JDIM) | | ( (n*D) >MAX_IDIM) ) { 
printf ("Horsefeathers! Too big for our buffers! ixj «%dx%d\n« , n*D, n*P) ; 
if ( (n>16) | | ( (m>5)&&(mtemp>5) ) ) exit(l); 
alive =0; 

} 
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if (alive) { 

/*{{{ read in and check the data*/ 

memset (joefile, <int)'0», MAX_JDIM*MAX_IDIM) ; 

datalines = 0; 

memset (text line, (int) 1 7 1 , D) ; 
while (fgete (textline, 511, stream) !=NULL) { 
printf (">>>%s" , textline) ; 
for (x«0;x<D;x++) { 

if ((((int)textline[x])-48)>>l) { 

printf ("Illegal character in data line %d . \n" , datalines) ; 
alive = 0; 

} 

} 

memcpy (& joefile [datalines+P] [0] , textline , D) ; 

memset (textline,. (int) # 7 1 ,D) ; 

datalines++; 

} 

printf ("Total of %d lines of data. datalines ) ; 

if (datalines==( (n-l)*P) ) {printf (" Correct .\n") ; 

} else {printf C Should have been %d. \n" , (n-1) *P) , alive = 0,-} 

/*}}} */ 

} 

f close (stream) ; 
if (alive) { 

/*{{{ fill up matrix and print it.*/ 

for (i=D;i< (n*D) ;i+=D) { 
x s i - D; 

for (j=P; j< (n*P) ;j++) memcpy (tjoefile [j ] [i] , fcjoef ile [ j -P] [x] ,D) ; 

y = (n-l)*P; 

for (j=0; j<P; j++) { 

memcpy (&joef ile [j] [i] , &joef ile [y] [x] ,D) ; 

y++; 

} 

} 

textline [n*D] = «\0 f ; 
for (j=0; j<(n*P) ; j++) { 

memcpy (textline, fcjoef ile [j ] [0] ,n*D) ; 

printf ( " %s\n M , textline) ; 

} 

/*}}} •/ 

} 

/*}}} */ 

/*{{{ find all possible essentially different cases of deads*/ 
mflip =0; 

if ( (m>5)&&(mtemp<6) ) { 
mflip = m; 
m = mtemp ; 
mtemp = mflip; 

} 

if <m==l) { 

/*{{{ just 1*/ 
ndeads = 1 ; 

jumps [0] [0] = (char) n; 
jumps [0] [1J « , \0» ; 

/*}}} */ 
} else if (m==2) { 

/*{{{ n/2 of them*/ 
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ndeads = 0 ; 

for (i=l;i<«=(n/2) ;i++) { 

jumps [ndeads] (0] = (char) i; 
jumps [ndeads] [1] = (char) (n-i) ; 
jumps [ndeads] [2] « 'NO'; 
ndeads ++; 

} 

/*}}} */ 
} else if (m==3) { 

/*{{{ special case if n divisible by 3*/ 
ndeads = 0 ; 
textline [3] = 'NO'; 
for (jol; j<=(n/3) ; j++) { 
for (i=j;i<(n-j-j) ;i++) { 
textlinetO] = (char) j; 
textlinetl] = (char) i; 

textline[2] = (char) (n-i-j); /* strictly bigger than j */ 
strcpy (& jumps [ndeads] [0] , text line) ; 
ndeads ++; 

} 

if <(3*j)==n) { 

memset (textline, j , 3) ; 

strcpy (fcjumps [ndeads] [0] , textline) ; 

ndeads ++; 

} 

} 

/*}}} */ 

} else if (m==4) { 
/*{{{ m = 4*/ 
int passed; 
textline [4] = ' \ 0 ' ; 
for (x=l;x<= (n/4) ;x++) { 

for ( j=x; j< (n- (3*x) ) ; j++) { 
for (i=x; i< (n- j -x-x) ; i++) { 
y«n-x-i-j; 
textline [0] = (char) x; 
textline [1] = (char) j ; 
textline [2] = (char) i; 
textline [3] = (char) y; 
passed =1; 

/* x is the minimum and y=n-x-j-i>x. If j«x, xjiy < jiyx <iyxj */ 
/* and xjiy < yxji whether or not i==x. If j, i, and y all >x */ 
/* obviously it is unique minimum. If j >x and i==x the only */ 
/* case to test is xjiy vs iyx j . * */ 

if (i«=x) { 

if (y<j) passed = 0; 

} 

if (passed) { 

strcpy (fcjumps [ndeads] [0] , textline) ; 
ndeads++; 

} 

} } 

if (<4*x)==n) { 

memset (textline, x, 4) ; 

strcpy (Sc jumps [ndeads] [0] , textline) ; 

ndeads++; 
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} 

} 

/•}}} */ 
} else if (m==5) { 
/*{{{ m = 5*/ 
int passed; 
textline[5] = * \0 f ; 
for (x=l;x<= (n/5) ;x++) { 

for <j=oc;j< <n- (4*x) ) ; j++) { 

for (i=x;i<(n-j-(3*x) ) { 
for (y=x;y< (n- j -i-x-x) ;y++) { 
z=n-x-i-j - y ; 
textlinetO] = (char) x; 
textline[l] = (char) j ; 
textline[2] ■ (char) i; 
textline[3l = (char) y; 
textline[4] = (char) z; 
passed = 1 ; 

/* x is the minimum and z=n-x- j -i -y>x. The only overlapping */ 
/* cases are xj iyz=xxixz<xzxxi and xj iyz=xjxyz-xyzxj where the */ 
/* places held by x are equal to x and the others >x, */ 
/* Suffices to discard "i==x&&y==x&&j >x" and *"i>x&&j >x&&y==x" */ 
/* that is to say "y=c= x &&j >x" . */ 
if (y==x) { 

if (x<j) passed = 0; 

} 

if (passed) { 

strcpy (&jumps [ndeads] [0] ,textline) ; 
ndeads++; 

} 

} 

} 

} 

if ( (5*x)==n) { 

raeraset (textline,x, 5) ; 

strcpy (tjumps [ndeads] [0] , textline) ; 

ndeads++; 

} 

} 

/*}}} V 

} else { 

printf ("m too big or too small. \n M ) ; 
exit (1) ; 

} 

if (mflip) { 

int i, j, bits, errknt; 
errknt = 0 ; 

for (i=0; i<ndeads; i++) { 

bits a -steptobit ( (char * ) fcjumps li] [0] ) ; 

if (bittostep(bits # (char *)&jumps[i] [0]) 1 =mtemp) errknt++; 

printf ("%d errors flipping deads and alives . \n" , errknt ) ; 
mflip = m; 
m = mtemp; 
mtemp = mflip; 

printf <"%d distinct cases found: " , ndeads) ; 
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X = 0; 

for ( j=0; j<ndeads; j+«4) { 
printf ("\n") ; 

for(i=j; (i<(j+4) )&&(i<ndeads) ;i++) { 
printf (" %2d M , (int) jumps [i] [0] ) ; 

for (y=l;y<m;y++) printf (" . %2d" , (int) jumps (i] [y] ) ; 

} 

} 

printf ("\n") ; 
/*}}} */ 

/*{{{ test them all (jumps up to ndeads) 1*/ 
if (alive) alive = ( (m*D) <65) ; 
if (alive) { 

int success; 

int nups = n - m; 

int upinds [64] , downinds [64] ; 

int downs [18] ; 

ndim = m*D; 

success =1; 

for (y«0;y<ndeads;y++) { 

int determinant, invsuccess; 

/*{{{ downs [j] = jth disk down (0 and m at 0)*/ 

downs [0] = 0 ; 

for (i=0 ; i<m; i++) { 

downs [i+l] = downs [i] + ( (int) jumps [y] [i] ) ; 

} 

/*}}} */ . % , 

/*{{{ downinds [i] = i-th index on D side (i of matrix)*/ 

i = 0; 

for (x=0;x<m;x++) { 

int ind = D*downs [x] ; 
for (j=0; j<D; j++) { 

downinds [i] = ind + j; 

i++; 

} 

} 

/*}}} */ 

/*{{{ upinds [i] - i-th index on P side (j of matrxx) */ 
i = 0; 
j = 0; 

for (x=0 ;x<n,-x++) { 
if (x==downs [j ] ) { 

j++; 
} else { 

int ind = P*x; 
int k; 

for (k=0;k<P;k++) { 
upinds [i] = ind + k; 
i++; 

} 

} } 

if (il«=ndim) {printf ( "weird counting error\n" ) ; exit (1) ; } 
/*}}} */ 

/*{{{ generate ndim by ndim testmatrix*/ 
/* if (ndim>32) { */ 

memset ( (void *) ltestmatrix, 0, 512) ; 
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for (j=0; j<ndim; j++) { 

unsigned long long test ■ ltests[j]; 
char *local = fcjoef ile [upinds [ j ] ] [0] ; 
for (icO ; i<ndim; i++) { 

if ( local [downindsti]]" 1 !') ltestmatrix [i] |« test; 

} } 

memcpy ( (void *)ltestm2, (void *) ltestmatrix, 512); 
determinant « det64(); 
if (determinantcl) { 

int rankshort; 

rankshort = 0; 

for (i=0;i<ndim;i++) { 

if (ltestmatrix [i] == (long long)0) rankshort++; 

printf ("Rank shortage >= %d\n» , rankshort) ; 

memcpy ((void *) ltestmatrix, (void *)ltestm2, 512); 

/*}}} */ 
invsuccess = 0; 

if (determinant) { 

invsuccess = inv64(); 

} 

printf (" %2d" , downs [0] ) ; 

for (i=l;i<m;i++) printf (". %2d" , downs [i] ) ; 

printf <": returned det m %d, inverse-check = %d\n", 

determinant, invsuccess) ; 
if (determinant 1=1) success = 0; 
if (invsuccess !=1) success = 0; 

if (success) {printf ("%d,%d test SUCCESSFUL. \n" , n,m) ; 
} else {printf ( M %d,%d test FAILED . \n" , n, m) ; } 

} 

/*}}} */ 
exit (0) ; 

} 
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#include <stdio.h> 
ttinclude <stdlib.h> 

#define NMAX 20 
#define NUMMATMAX 200 

#define NUMDDMAX NMAX 
#define NUMCDMAX NMAX 

#define MATROWS I NMAX NMAX * NUMCDMAX 
tfdefine MATS I Z EMAX NMAX * NUMDDMAX 

main(argc, argv) 
int argc; 
char *argv [] ; 

^ extern int gausstestO, gencaseO, gn2fast(), gnslowO , initO, showanswO; 
extern int gauss (); 

int heartcount, heartbeat, matnow, n, m, nurndd, numcd, numt, nummat, i; 
int channelj)attern [NUMMATMAX *NMAX] , code_channel , code^division, prtinv; 
int data_division, c [ (NMAX-2 ) *NUMDDMAX] , ctot [NUMDDMAX] , gntype, gninit; 
int numnow, log 2 num [MATROWS I NMAX] , totsol, numeol, code, d, tdiv, flip; 
int save time [ MATROWS INMAX] , matsize, complete, ctottot, c cd tot [NUMCDMAX] ; 
long int num [MATROWS INMAX ] ; 

unsigned long int mat [MATSIZEMAX] , matinv [MATSIZEMAX] , totit, list [NUMMATMAX] ; 
if (argc < 6) { 

printf ( M \nCommand line arguments : \nn, m, heartbeat, # of solutions, ") ; 
printf ("getnext type, <t divisor>, <flip>, <print-inv>\n\n" ) ; 
return (0) ; 



n = atoi (argv [1] ) ; 
m = atoi (argv [2] ) ; 
heartbeat = atoi (argv [3] ) ; 
numsol = atoi (argv [4] ) ; 
gninit = atoi (argv [5] ) ; 

gntype = ((gninit == 2) ? 1 : gninit); 
tdiv = ((argc < 7) ? 1 : atoi (argv [6] )) ; 
if (tdiv < 1) tdiv = 1; 

flip = ((argc < 8) ? 0 : atoi (argv [7] )) ; 
prtinv ■ ((argc < 9) ? 0 : atoi (argv [8] )) ; 

gencase (&code, &d, tnumdd, &numcd, fcnumt, tnummat, channel_pattern, list, 



init (fcheartcount , ^heartbeat, num, &code_channel, &code_division, 
&data division, c, ctot, fcnumnow, fctotit, fctotsol, savetime, 
n, m,~numdd, numcd, fcmatsize, gntype, &ctottot, ccdtot) ; 

for (;;) { 

if ((totit == 1) (gninit == 2)) gntype = 0; 

switch (gntype) { 
case 0 : 

complete = gnslow(num, tnumnow , n, nurndd, numcd) * (totit != 0) ; 



n, m, tdiv) ; 
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break; 
case 1 : case 2 : 

complete = gn2fast(num, &code_channel , &code_division, &data_division, 

c, ctot # &numnow, log2num, savetime, n, m, numdd, numcd) ; 
if ((totit »- 0) && (gninit == 2)) gntype = 0; 
if (flip != 0) { 

num[(n - 1) * numcd - 1] = 1 ; 

num[(n - 1) * numcd - 2] =0; 

) 

break; 
case 3 : 

complete = gn3fast(num, &code_channel , &code_division, &data_division, 
c, ctot, &numnow, log2num, savetime, n, m r numdd, numcd, fcctottot, 
ccdtot) ; 

break; 

case 4 : 

complete = gn3f ast (num, &code_channel , &code_division, &data_division, 
c, ctot, &numnow / log2num, savetime, n, m, numdd, numcd, fcctottot, 
ccdtot) ; 

for (i = 0; (num[0) & (1 « i) ) i < (numdd- 1) ; i++) ; 

num [0] |= 1 << i; 

break; 

} 

if (complete) break; 
mat now = 0; 

while (matnow != nummat) { 

make mat (num, channel_jpattern, mat, matnow, n, m, numdd, numcd); 
if (gausstest (mat , matsize) == 0) matnow++; 
else break; 

} 

totit++; 

if (matnow == nummat) { 
totsol++; 

showanswd, num, totsol, totit, matnow, n, m, numdd, numcd, nummat, tdiv); 
if (prtinv != 0) for (i = 0; i < nummat; i++) { 

makemat (num, channel_pattern, mat, i, n, m, numdd, numcd); 

gauss (mat, matinv, matsize, i) ; 

} 

if (totsol == numsol) break; 

} 

if (heartcount != heartbeat) heartcount++; 
else { 

showansw(0, num, totsol, totit, matnow, n, m, numdd, numcd, nummat, tdiv); 
heartcount = 1; 

} 

} 
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/♦ Execution is complete ♦/ 

if (totsol != numsol) showansw(-l, num, totsol , totit, matnow, n, m, numdd, 

numcd, nummat, tdiv) ; 

return (0) ; 

> 

int gencase(c, d, numdd, numcd, numt, nummat, channel_pattern, list, n, m, tdiv) 
int n, m, *c, *d, ♦numdd, *numcd, *numt, ♦nummat, tdiv; 
int channel_j>attern[NUMMATMAX*NT4AX] ; 
unsigned long int list [NUMMATMAX] ; 



unsigned long int imin, inew, ishift, irot, k, nbound, nmask; 
int i, j, 1cm, missing, received; 

for (i = 1, 1cm = 1; i <= n; i++) 
if ( (lcm- (lcm/i) *i) != 0) { 

for (j = 2; (j*lcm« ( ( (j*lcm)/i) *i) ) != 0; j++); 
1cm ♦= j ; 

} 

*d = lcm/n; 
*c = *d*m/ (n-m) ; 
♦numdd = n - m; 
♦numcd = m; 

i = ((♦numdd < *numcd) ? ♦numdd : ♦numcd); 
for (j = i; j > 1; j--) { 

if ( ( (*numdd - (( ♦numdd/ j ) *j ) ) == 0 ) && 

( (♦numcd - ( (*numcd/ j ) * j ) ) == 0 ) ) { 
* numdd /= j ; 
♦numcd /= j ; 

} 

} 

♦numt = ♦d/ (♦numdd) ♦tdiv ; 
* numdd *= tdiv; 
♦numcd *= tdiv; 

printf ( w \n\nn = %i, m « %i, d = %i, c = %i, n,- m, *numdd, * numcd ) ; 
printf ( "p = %i\n M , tdiv) ; 

nbound = 1 << n; 
nmask «= nbound - 1; 
♦nummat = 0 ; 

for (i = 0; i < nbound; i++) { 

for (k = 0, inew = i; inew != 0; inew >>= 1) if (inew & 01) ++k; 
if (k == m) { 

imin = inew = i ; 
for (j = 1; j < n; j++) { 
ishift = inew << 1; 

inew = (ishift & nmask) + ((ishift & nbound) 1= 0) ; 
if (imin > inew) imin = inew; 

} 

for (j = 0; j < ♦nummat; j++) if (imin «== list[j]) break; 
if (j == ♦nummat) list [ (♦nummat) ++] = imin; 

} 

} 
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for (i = 0; i < *nummat; i++) { 
imin « list [i] ; 
received = 0; 
missing = n - m; 

for (j = 0, irot = 1 << (n-1) ; j < n; j++, irot >>« 1) 

channel_j>attern [ (i * n) + ((imin & irot) ? missing++ : received**) 1 = 

} 

printf ("\nThe following %i unique cases are to be tested:", *nummat) ; 
for (i = 0; i < *nummat; i++) { 

printf (*'\n%u> %lu: : M , (i+1) , listli]); 
for (j = 0; j < n; j++) { 

if (j == n - m) printf (" "); 

printf ("%i " , channel_pattern [ (i * n) + j]); 

} 

} 

printf ("\n\nThe first pattern generated is:"); 

} 

int init (heart count , heartbeat, num, code_channel , code_division, 
data_division, c, ctot, nutnnow, totit, totsol, savetime, 
n, m, numdd, numcd, matsize, gntype, c tot tot, ccdtot) 

int *heartcount, *heartbeat, n, m, numdd, numcd, *matsize; 

long int num{] ; 

int *code_channel, *code_division, *data_division, c[]; 

int ctot [] , *numnow, *totsol, savetime [] , gntype, *ctottot, ccdtot []; 

unsigned long int * tot it; 

{ 

int i, j; 

♦totit = 0; 
♦totsol = 0; 



*ctottot = 0; 



♦heartcount = *heartbeat; 

switch (gntype) { 

case 3 : case 4 : 

♦numnow = ( (n - 1) * numcd) - 1 ; 
*data_division = -2; 
*code_channel = n - 2 ; 
break; 



default: 

♦numnow = ( (n - 2) * numcd) - 1; 
* da ta — division = -1; 
*code_channel = n - 3 ; 
break ,- 

} 

*code_division = numcd - 1; 
* matsize = m * numdd; 
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for (i = 0; i < numdd; i++) { 
ctot[i] = -m; 

for (j = 0; j < n - 2; j++) c[(j * numdd) + i] = 0; 

for (i = 0; i < numcd; i++) ccdtot[i] = 0; 

for (i = 0; i < ( (n - 1) * numcd); i++) num[i] = (savetime[i] 



« 0); 



} 



int gnslow(num, numnow, n, numdd, numcd) 
long int num [MATROWSINMAX] ; 
int *numnow, n, numdd, numcd; 



{ 



♦numnow = 0; 

for ( ; ; ) { /* Loop forever 

if (num[*numnow] != (l<<numdd) -1) { 
++num[* numnow] ; 
if (*numnow==0) return (0) 
else num [ - - * numnow] = -1; 

} 

else { 

if ( * numnow == ( (n - 1) * 
else ++*numnow; 

} 



/* If num is not too high */ 



} 



} 



/* Update num 
/* If this is the lowe.st numnow, return 
/* Else this is not lowest numnow, so update 
/* End If num is not too high 

/* Else num is too high 
* numcd) - 1) return(l); /* Final return 

/* Go to a higher numnow 
/* End Else data division is too high 
/* End of Loop forever 
/* End of main 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



int gn2fast(num, code_channel , code_division, data_division, c, ctot, numnow, 

log2num, save time, n, m, numdd, numcd) 
long int num [MATROWSINMAX] ; 

int *code_channel, *code_division, *data_division, *numnow; 

int savetime [MATROWSINMAX] , C [ (NMAX-2) *NUMDDMAX] , Ctot [NUMDDMAX] ; 

int log2num [MATROWSINMAX] , n, m, numdd, numcd; 



{ 



if (*data_divisiqn >= 0) { 

- -c [ ( *code_channel * numdd) + 
--ctot [*data_division) ; 

} 



*data_ division] ; /* Decrement 

/* Decrement ctot 



for(;;) { /* Loop forever 

if (++*data_division < numdd) { /* If new data division is not too high 
/* If current position is OK, mark it as used and possibly do updates 
if ( (savetime [* numnow] ==0) || (ctot [*data_divis ion] 1= -m) ) { 
if (ctot [*data_division] == -m) savetime [* numnow] = 1; 
/* If current choice meets tests, do updates 
if ( (ctot [*data_division] < 0) 
&& (c [ (*code_channel * numdd) + *data_division] ==0) ) { 
num[*numnow] = 1 << *data_division; /* Update num 

log2num [*numnow] = *data_division + 1; /* Update log2num 

++c [ (*code_channel * numdd) + *data_division] ; /* Increment c 

++ctot [*data__division] ; /* Increment ctot 

if (*numnow«c0) return(O);/* If this is the lowest numnow, return 
else { /* Else this is not the lowest numnow 

--♦numnow; /* Decrement numnow 



*/ 
*/ 



*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
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*data_division = -1; /* Adjust to new data division 

savetime [*numnow] = 0; /*Reset to virgin position not yet found 
if < --*code_division < 0) { /* If new code division is too low 



} 



} 



} 



*code__division = numcd - 1; 
- - *code_channel ; 



/* Reset code division 
/* Decrement code channel 
/* End If code division is too low 
/* End Else this is not lowest numnow 
/* End If current choice meets tests 
/* End If current position is OK 
End If data division is not too high 

/* Else data division is too high 
numcd) /* If everything checked 

/* Final return 
/* Go to a higher numnow 
*data_division=log2num[++*numnow] -1; /* Update data_div. & numnow 
if (++*code_division==numcd) { /* If new code division is too high 



} 

} 

else 

if (* numnow 
return (1) ; 
else { 



{ 



/* 

(n - 3) * 



*code_division = 0; 
++*code_channel ; 

} 

--c [ (*code_ channel * numdd) 
--ctot [*data_division] ; 



} 



/* Reset code division 
/* Increment code channel 
/* End If new code division is too high 
k *data_divisionJ ; . /* Decrement c 

/* Decrement ctot 
'* End If this is not: the highest numnow 
/* End Else data division is too high 
/* End of Loop forever 
/* End of main 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
•/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
•/ 
*/ 



int gn3fast(num, code_channel , code_division, data_division < c, ctot, numnow. 

log2num, savetime, n, m, numdd, numcd, ctottot, ccdtot) 
long int num [MATROWSINMAX] ; 

int *code_channel , *code_division, *data_division, *numnow; 

int savetime [MATROWSINMAX] , C [ (NMAX-2) *NUMDDMAX] , Ctot [NUMDDMAX] ; 

int log2num [MATROWSINMAX] , n, m, numdd, numcd, *ctottot, ccdtot [NUMCDMAX] ; 



{ 



int i, nmmax; 

nmmax = (m > (n - m) ? (m) 



(n - m) ) ; 



if 



} 



(*data_division >= 0) { 
--c t (*code_channel * numdd) + *data_division] ; /* Decrement c 

--ctot [*data_division] ; . /* Decrement ctot 

--ccdtot [*code_division] ; 
--♦ctottot; 



numdd | | savetime [* numnow] ==1) 



for ( ; ; ) { 

if (++*data_division 

if (* numnow == (n - 1) * numcd - 1) 

return (1) ; 
else { 

*data_division=log2num [++*numnow] ; 
if (++*code_division==numcd) { / 

*code_division = 0; 

+ + * c ode channe 1 ; 



/* Loop forever 
{ /* If need higher 



/* If everything checked 
/* Final return 
/* Go to a higher numnow 
/* Update data_div. & numnow 
If new code division is too high 
/* Reset code division 
/* Increment code channel 



*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



} /* End If new code division is too high 

if (*data_ division >= 0) { 

--c [ (*code_channel * numdd) + *data_division] ; /* Decrement c */ 



c- 



119 



BNSDOCID; <WO. 



0110043A1_I_> 



WO 01/10043 



PCT/US00/20398 



} 



--ctot [*data_division] ; 
--ccdtot [*code_division] ; 
--♦ctottot; 



/* Decrement ctot */ 



*/ 
*/ 
*/ 



} /* End If this is not the highest numnow 

/* End If data division is too high 
se { /* if new data division is not too high 

/* If current position is OK, mark it as used and possibly do updates 
if (*data_division<0 | | savetime [*numnow] ==0 | | ctot [*data_division] !=-m) 

if (*data_division>=0 &&ctot [*data_division] ==-m) savetime [*numnow] « 1; 

/* if current choice meets tests, do updates */ 
if (*data_division < 0 || ( (ctot [*data_division] < 0) 
ScSc (c[{*code__channel * numdd) + *data_division] ==0 ) ) ) { 



{ 



>> 1; 



/* Update num 
Update log2num 



*/ 
*/ 



/* Increment c 
Increment ctot 



num[*numnow] = (1 << ( *data_division + 1)) 
log2num [*numnow] = *data_division; 
if (*data_division >= 0) { 

++c[(*code_channel * numdd) + *data_division] ; 
++ctot [*data_division] ; 
++ccdtot [*code_division] ; 
++*ctottot; 

} 

if ( * numnow ! =0 ) { 

*data_division = -2; /* Adjust to new data division 

savetime t--*numnow] =0; /* Indicate unused position not yet found 
if (--*code_division < 0) { /* If new code division is too low 
*code_division = numcd - 1; /* Reset code division 

--*code_channel; /* Decrement code channel 

if (*ctottot < (numdd * iri * (nmmax - 1 - *code_channel ) /nmmax) ) 

savetime I* numnow] = 1; 
for (i = numcd - 1; i >= 0; i- - ) 
if ((ccdtot[i] > m* numdd/ numcd) || 
(ccdtotli] < m*numdd/numcd - 1 - *code_channel) ) 
savetime [* numnow] = 1; 

/* End If code division is too low */ 
/* End If this is not lowest numnow */ 

i--) if (ccdtot[i] > m*numdd/numcd) { 



*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 



} 



} 



else if (*ctottot == numdd *m) 
for (i = numcd - 1; i >= 0; 
savetime [* numnow] = 1 ; 
break; 

} 

if (i < 0) return (0) ; 

else if (*data_division >= 0) 

- -c [ ( *code_channel * numdd) 

--ctot t*data_division] ; 

--ccdtot [*code_division] ; 

--*ctottot; 



/* return for now */ 

{ 

+ *data_divieion] ; /* Decrement c 
/* Decrement ctot 



*/ 
*/ 



} 

} 

els 



e if (*data_division >= 0) 
-c [ (*code_channel * numdd) 
-ctot [*data_division] ; 
-ccdtot [*code_division] ; 
-*ctottot; 



{ 

+ *data division] ; 



/* Decrement c */ 
Decrement ctot */ 



/* End If current choice meets tests */ 
/* End If current position is OK */ 
/* End Else data division is not too high */ 
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} /* End of Loop forever */ 

} /* End of main */ 

makemat (num, channel_pattern, mat, matnow, n, m, numdd, numcd) 
long int num [MATROWSINMAX] ; 
unsigned long int mat [MATSIZEMAX] ; 

int channel_pattern[NUMMATMAX*NMAX] , matnow, n, m, numdd, numcd; 
{ 

int i, j, k, p, x; 
unsigned long int y; 

for (i = 0; i < (n - m) ; i++) { 

x = channel pattern [ (matnow * n) + i] ; 
for (j =0; j < numcd; j++) { 
y = 0; 

for (p = (matnow * n) + (n - m) ; p < (matnow * n) + n; p++) { 
k = j + (n-l)* numcd + (numcd * (x - channel ^pattern tp] ) ) ; 
while (k < 0) k += (n * numcd) ; 
while (k >= (n * numcd)) k -= (n * numcd) ; 
y = (y << numdd) | num [k] ; 

} 

mat [ (i*numcd) +j] = y; 

} 

} 

return; 

} 

int gausstest (mat , matsize) 
unsigned long int mat [MATSIZEMAX] ; 
int matsize; 
{ 

int i , j , k ; 
unsigned long int r; 

for (i=k=0, r = 1 << (matsize-1) ; i < matsize; k = ++i, r>>=l) { /* Columns */ 
if (mat[i] < r) { /* If diagonal point != l, find row with 1 in column */ 
for (k = i + 1; (k < matsize) && (mat [k] < r) ; k++) ; /* Looking */ 

if (k « matsize) return (1) ; /* Return if all zeroes in a column */ 

mat[k] mat[i] A = mat [k] ; /* Modulo-2 row switch! */ 

} /* after this statement, diagonal point is guaranteed to be a 1 */ 

for (j=0; j<i; j++) if ( (mat [j] &r) 1= 0) mat [ j ] ^= mat [i] ; /* Clean col */ 
for (j = k+1; j < matsize; j++) if (mat[j] >= r) mat[j] A - mat[i]; /* ■ */ 
j /* End of going through columns */ 

return (0) ; . 
} /* End of gaussian elimination test for a modulo-2 -invert able matrix */ 

gauss (mat, matinv, matsize, matnow) 

unsigned long int mat (MATSIZEMAX] , matinv [MATSIZEMAX] ; 
int matsize, matnow; 

{ 

int i, j, k; 
unsigned long int r; 

printf ( M \nCase %i original matrix\n" , (matnow + 1) ) ; 
for (i = 0; i < matsize; i++) { 
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for (j = matsize - 1; j >- 0; j--) printf («%i (mat [i] & (1 « j ) ) ? 1 : 

0) ; 

print f <"\n") ; 

} 

for (i = 0, r = 1 << (matsize-1) ; i < matsize; i + +, r >>« 1 ) matinv[i] = r ; 

for (i=k=0, r = 1 << (matsize-1); i < matsize; k = ++i, r>>=l) { /* Columns */ 

if (matti] < r) { /* If diagonal point != 1, find row with 1 in column */ 

for (k = i + 1; (k < matsize) && (mat [k) < r) ; k++) ; /* Looking */ 

if (k ~ matsize) return; /* Return if all zeroes in a column */ 

mat[k] A = matti] mat [k] ; /* Modulo-2 row switch! */ 

matinvfk] matinvti] A = matinvtk]; /* Switch rows in shadow matrix */ 

} /* After this statement, diagonal point is guaranteed to be a 1 */ 

for (j=0; j<i; j++) if ((mat[j] & r) != 0) { /* Rows above the diagonal */ 

mattj] A = mat[i]; /* Clean column to get rid of 1, except diagonal */ 

matinvfj] A = matinv[i]; /* Corresponding operation in shadow matrix */ 

for (j = k+1; j < matsize; j++) if (mat[j] >= r) { /* Rows below diagonal*/ 
mat[j] A = matti]; /* Clean column to get rid of .1, except diagonal */ 

matinvtj] matinvti]; /* Corresponding operation in shadow matrix */ 

} /* End of going through columns */ 

printf ( "\nCase %i inverted matrix\n", (matnow + 1) ) ; 
for (i =0; i < matsize; i++) { 

for (j = matsize-1; j >= 0; j--) printf ("%i " , (matinv [i ] & ( 1<< j ) ) ? 1 : 0) ; 

printf ("\n") ; 

} 

return; 

} /* End of gaussian elimination method to invert a 0-1 matrix in modulo-2 */ 

int showansw (showtype, num, tot sol, tot it, matnow, n, m, numdd, numcd, nummat, 
tdiv) 

long int num tMATROWSINMAX] ; 

int showtype, totsol, matnow, n, m, numdd, numcd, nummat, tdiv; 
unsigned long int totit; 

{ 

int i, j, k, p, x; 

if (showtype 1= 0) { 

printf ("\nn = %i, m = %i, d = %i, c = %i, " , n, m, numdd, numcd); 
printf ("p = %i, cases = %i", tdiv, nummat); 

} 

if (showtype > 0) prirrtf ( "\n\nSolution %i has been found:", totsol); 

if (showtype < 0) printf (" \n\nThe last pattern checked was:"); 

printf ("\n%lu %i: M , totit, matnow); 

for (i = 0; i < (n - 1); i++) { 

for (j = 0; j < numcd; j++) printf ("%lu num [ (i*numcd) + j ] ) ; 

printf C "); 

} 
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printf ("\n") ; 

if (showtype > 0) { 
printf ("\n") ,- 

for (i = p = 0; i < (n - 1) ; i++) 
for (j = 0; j < numcd; j++) { 

((j==0) ? printf (" ) : printf <" ")); 

for (k = (numdd - 1) ; k > = 0; k--) { 

p += x = ((num[(i * numcd) + j] 6 (1 « k) ) ? 1 : 0) ; 
printf ("%i", x) ; 

} 

((j==0) ? printf (" -\n") : printf ( "\n" >) ; 

} 

printf ( M \nSolution %i contains %i ones; «, totsol, p) ; 

printf ("the lower theoretical bound is %i ones\n\n", m* numdd ) ; 

} 

if (showtype < 0) switch (totsol) { 

case 0: printf ("\nNo solutions were found\n" ); break; 

case l: printf ("\nA total of 1 solution was found\n") ; break; 

default: printf ("\nA total of %i solutions were found\n", totsol); 

} 

} 
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APPENDIX D 
DUALITY THEOREM for Wiencko codes 



The following is an additional technique for finding Wiencko arrays and codes 
for those various applications. The "duality" to be described gives a one to one 
relationship between(n, m) Wiencko arrays and (n, n-m) Wiencko arrays. This 
means if you have an (n, m) Wiencko code, you can automatically get an (n, n- 
m) Wiencko code, and vice versa. 

DUALITY 

Let W be a j by k array of H by Q matrices. We define the "transpose" Wt of W 
as the k by j array of Q by H matrices such that, for every z, w such that 0<=z<j 
and 0<=w<k, 

t 

(i) wt = (w ) 

wz zw 

It follows trivially from this that the equivalent matrix to Wt is the transpose of 
the equivalent matrix to W. In discussions below, we will let z'w' be the array 
subscripts of Wt, so that z'=w and w'=z in (1). 

THEOREM (Duality): 

W is an (n, m) Wiencko array if and only if Wt is an (n, n-m) Wiencko array. 
Proof: 

It suffices to prove it in one direction (assuming W is Wiencko and proving Wt 
is), since exchanging m for n-m in the proof then proves it in the other 
direction. 
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It is trivial to show that if H and Q satisfy the test n/m = (H/Q)+l, then H'=Q 
and Q -H satisfy the test for (n, m') where m-n-m. 

All that remains, therefore, is to prove that the "invertibility* conditions are 
satisfied. But there is a one to one correspondence between the subsets S of m 
disks out of n, and the subsets S' of m'=n-m disks out of n: let S' be all the 
disks not in S. The subarray of W corresponding to S consists of all matrix 
entries with z in S' and w in S. But the subarray of Wt corresponding to S' 
consists of all matrix entries with z' in S and w' in S\ By (1) these correspond 
exactly, but transposed; so that the equivalent matrix to the subarray of Wt 
corresponding to S* is the transpose of the equivalent matrix to the subarray of 
W corresponding to S. 

If a matrix is invertible, so is its transpose. Since this holds for all possible 
values of S', it is proven that Wt satisfies the invertibility conditions. This 
completes the proof. 

Corollary: 

The one to one duality relation maps zero diagonal (n, m) Wiencko arrays into 
zero diagonal (n, n-m) Wiencko arrays, and maps rotationally symmetric (n, m) 
Wiencko arrays into rotationally symmetric (n, n-m) Wiencko arrays. 



Proof: 



Trivial consequence of the definitions. The final step is to describe how to 
exchange the standard descriptions. If the standard description of an (n,m) 



Wiencko code is 



Ml 
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M2 



M(n-l) 



where the M's are H by Q bit matrices, then it trivially follows that the standard 
description of its dual is 



M(n-l) 



t 

M2 

t 

Ml 



EXAMPLE 

The (9,4) solution has the following standard description: 



9 4 5 4 
01000 - 
00001 
10000 
00010 
00000 - 
00000 
00001 
10000 

00010 - 
01000 
00100 
10000 
00010 - 
10000 
00100 
01000 
01000 - 
00100 
00000 
00001 
00100 - 



t 
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00010 

00001 

00000 

00000 - 

00000 

00000 

00000 

00000 - 

00000 

00000 

00000 

By applying the duality, the following (9,5) code can be found: 

9 5 4 5 
0000 - 
0000 
0000 
0000 
0000 
0000 - 
0000 
0000 
0000 
0000 

oooo - 

0000 
1000 
0100 
0010 

0000 - 
1000 
0100 
0000 
0001 
0100 - 
0001 
0010 
1000 
0000 

0001 - 
0100 
0010 
1000 
0000 
0001 - 
0000 
0000 
0000 
0010 

0010 - 
1000 
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0000 
0001 
0100 



This code was confirmed as a Wiencko code. Other lacunae that can be filled by 
this method are (10,7), (13,11), (15,13), (16,14), (17,15), (18,16) and (18,15). 
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/*{{{ includes*/ 

/* #include <sys/types .h> 

#include <sys/stat.h> 

tfinclude <fcntl.h> 

#include <termios.h> */ 
#include <stdio.h> 
^include <stdlib.h> 

/*}}} •/ 

/*{{{ defines*/ 

tfdefine DEBUG 1 
tfdefine MAX_JDIM 100 
#define MAX_IDIM 306 
#define MAX_LENGTH 8192 
fldefine MAX_COUNT 1024 
tfdefine FILENAME "file. joe" 
#define FILEOUT "file.ljd" 

/*}}} */ 

main(int argc, char *argv[]) 
{ 

FILE *stream; 

char textline[512] , filename [128] , rotated [10] , f ileout [128] ; 
char joefile[MAX_JDIM] [MAXJEDIM] ; 

int n, m, D, P, datalines , i , j , x, y , z , ndeads , alive , ndim, mtemp , mf lip ; 
/* START OF THE STATEMENTS */ 
if (argol) { 

strcpy (f ilename,argv [1] ) ; 
} else { 

strcpy (filename, FILENAME) ; 

} 

if (argc>2) { 

strcpy (f ileout, argv [2] ) ; 
} else { 

strcpy (f ileout, FILEOUT) ; 

printf ("\n\nTEST PROGRAM dualize READS %s . . . \n\n\n" , filename); 
alive = 1; 

/*{{{ build the D to P matrix*/ 

stream = fopen( filename, ••r") ; 
if (stream«=NULL) { 

printf ("%s does not exist . \n" , filename) ; 

exit(l); 

printf ("%s exists. Reading in its data. \n" , filename) ; 
if (f gets (text line, 511, stream) !=NULL) { 

sscanf( text line, "%d %d %d %d" , &n, &m, &D, &P) ; 

printf ( n n=%d ( m=%d , D=%d , P=%d\n" , n , m, D , P) ; 

if (n&Scm&&D&&P&& (n>m) && ( (n*P) == ( (D+P) *m) ) ) { 
printf ("Confirmed correct ratio. \n M ); 

} else {printf ("Error in ratio or zero entry . \n" ) ;exit (1) ; } 

} 

mtemp « n-m; f 
if ( (n>18) | | ( (m>5) &&(mtemp>5) ) | | ( (n*P) >MAX_JDIM) | | ( (n*D) >MAX_IDIM) ) { 

printf ("Horsefeathers! Too big for our buffers I ixj «=%dx%d\n M , n*D, n*P) ; 

if ( (n>18) | | ( (m>5) &&(mtemp>5) ) ) exit(l); 

alive = 0; 

V>-(o 
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} 

if (alive) { 

/*{{{ read in and check the data*/ 

memset ( joef ile, (int) '0«, MAX_JDIM*MAX_IDIM) ; 

datalines = 0; 

memset (text line, (int) ' 7 • ,D) ; 
while (f gets (textline, 511, stream) !«=NULL) { 
print f (" >>>%s" , textline) ; 
for (x=0,*x<D;x++) { 

if ((((int)textline[x])-48)>>l) { 

printf ("Illegal character in data line %d . \n" , datalines) ; 
alive = 0; 

} ) 

memcpy (fcjoef ile [datalines+P] [0] , textline, D) ; 

memset (textline, (int) , 7 , ,D); 

datalines++; 

printf ("Total of %d lines of data .", datalines) ; 

if (datalines==((n-l)*P)) {printf (" Correct . \n" ) ; 

} else {printf (" Should have been %d . \n" , (n- 1) *P) ; alive = 0;} 

/*}}} */ 

} 

f close (stream) ; 
if (alive) { 

/*{{{ write the transposed data*/ 

• { 

int md = n - m; 

int Pd = D; 

int Dd = P; 

int ndisk, id, jd; 

stream = fopen( fileout, "w M ); 

if (stream==NUIjLi) { 

printf ("%s failed to open . \n" , fileout) ; 

exit(l); 

printf ("%s is open for writing . \n» , fileout) ; 
fprintf (stream, "%d %d %d %d\n" , n, md, Dd, Pd) ; 
for (ndisk=l;ndisk<n;ndisk++) { 

int nofs ■ (n-ndisk) *Dd; 

int i,j; 

for (j=0; j<Pd; j++) { 

for (i=0;i<Dd;i++) textline [i] = joef ile [nofs+i] [j ] ; 
textline [Dd] = 1 \0 • ; 

if (j) {fprintf (stream, "%s\n" , textline) ; 

} else {fprintf (stream, "%s -\n" , textline) ; } 

) 

} 

fclose (stream) ; 

printf ("%s is complete . \n" , fileout) ; 

} 

/*}}} */ 

} 

/*}}) */ 
exit (0) ; 

} 
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CLAIMS 



1 . A data storage apparatus, comprising: 
a plurality of n data storage devices; 

5 data comprising a plurality of n data groupings; 

each one of said n data groupings stored in one of said n data storage 
devices; 

each one of said n data groupings comprising a data portion and a data 
redundancy portion; and 
10 said n data portions being recoverable from any and all combinations of 

n-m data grouping(s) on n-m data storage device(s) when the other m data 
grouping(s) are unavailable, where 1 < m < n. 

2. A data storage apparatus according to claim 1, further comprising: 
15 a storage controller; and 

said storage controller operative to recover said n data portions from any 
and all combinations of n-m data grouping(s) on n-m disk(s) when the other m 
data grouping(s) are unavailable. 



20 3. A method for recovering data, comprising: 

receiving a plurality of n-m data grouping(s) of a plurality of n data 
groupings of data, where 1 < m < n, each n data grouping comprising a data 
portion and a redundancy portion; and 

recovering the n data groupings from the plurality of n-m data groupings 
25 when the other m data groupings are not available. 
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4, A disk array having n disks, comprising: 
a stripe set stored across the n disks; 

each stripe of the stripe set comprising a data portion of H data chunks 
and a redundancy portion of Q redundancy chunks; 
5 a relationship existing between the data portions and the redundancy 

portions; 

the relationship being based on an n*H by n*Q bit matrix, the bit matrix 
being such that: 

the bit matrix is representible by an n by n array of H by Q bit 
10 submatrices, where n/m = (H/Q)+l; 

the bit matrix has a plurality of n!/(m!*(n-m)!) composite bit 
submatrices definable therefrom, each such composite bit submatrix being 
definable from bit submatrices at the intersection of a unique selection of m 
column(s) of the n by n array and a unique selection of (n-m) row(s) of the n by 
15 n array that correspond to those (n-m) column(s) not included in the unique 
selection of m column(s), where each one of the composite submatrices is 
invertible; and 

the relationship between the data portions and the redundancy portions 
being such that each one of Q redundancy chunks is the exclusive-OR of the 
20 n*H bits of the data portions and the n*H bits in the row of the bit matrix 
associated with such redundancy bit. 
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5. An apparatus suitable for use in generating redundancy data, 
comprising: 

a storage medium; and 

codes embedded in said storage medium, said codes being representible 
5 by an n*H by n*Q bit matrix, the bit matrix being representible by an n by n 
array of H by Q bit submatrices, where n/m = (H/Q) + 1, the bit matrix having 
a plurality of n!/(m!*(n-m)!) composite submatrices definable therefrom, each 
composite submatrix being definable from submatrices at the intersection of a 
unique selection of m column(s) of the n by n array and a unique selection of 
10 (n-m) row(s) of the n by n array that correspond to those (n-m) column(s) not 
included in the unique selection of m column(s), each such composite 
submatrix being invertible. 
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6. A programmable gate array (PGA), comprising: 
logic gates configured to execute a set of XOR-based functions; 
the set of XOR-based functions being representable by an n*H by n*Q bit 
matrix, the bit matrix being such that: 
5 the bit matrix is representible by an n by n array of H by Q bit 

submatrices, where n/m = (H/Q) + 1; 

the bit matrix has a plurality of n!/(m!*(n-m)!) composite 
submatrices definable therefrom, each composite submatrix being definable 
from submatrices at the intersection of a unique selection of m column(s) of the 
10 array and a unique selection of (n-m) row(s) of the array that correspond to 

those (n-m) column(s) not included in the unique selection of m column(s); and 
each composite submatrix being invertible. 
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7. One or more data storage devices, comprising: 
a plurality of n storage location areas; 

each one of the n storage location areas having one of n data groupings 
stored thereat; 

5 each one of the n data groupings comprising a data portion of H data 

chunks and a redundancy portion of Q redundancy chunks, 

a relationship existing between the data portions and the redundancy 
portions; 

the relationship being based on an n*H by n*Q bit matrix, the bit matrix 
10 being such that: 

the bit matrix is representible by an n by n array of H by Q bit 
submatrices, where n/m = (H/Q) + 1; 

the bit matrix has a plurality of n!/(m!*(n-m)!) composite bit 
submatrices definable therefrom, each such composite bit submatrix being 
15 definable from bit submatrices at the intersection of a unique selection of m 

column(s) of the n by n array and a unique selection of (n-m) row(s) of the n by 
n array that correspond to those (n-m) column(s) not included in the unique 
selection of m column(s), where each one of the composite submatrices is 
invertible; and 

20 the relationship between the data portions and the redundancy portions 

being such that each one of Q redundancy chunks is the exclusive-OR of the 
n*H bits of the data portions and the n*H bits in the row of the bit matrix 
associated with such redundancy bit. 
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8. A redundancy data generation apparatus operative to execute one 
or more functions from a set of n redundancy data generation functions, 
comprising: 

the set of n redundancy data generation functions being representible by 
5 a bit matrix of dimensions n*H by n*Q; 

the bit matrix being representible by an n by n array of H by Q bit 
subma trices, where n/m = (H/Q) + 1; 

the bit matrix having a plurality of n!/(m!*(n-m)!) composite submatrices 
definable therefrom; 
10 each composite submatrix being definable from the intersection of a 

unique selection of m column(s) of the n by n array and a unique selection of 
(n-m) row(s) of the array that correspond to those (n-m) column(s) not included 
in the unique selection of m column(s); and 

each composite submatrix being invertible. 
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9. A method of generating redundancy data with a set of n 
redundancy data generation functions, the set of n redundancy data generation 
functions being representible by an n*H by n*Q bit matrix, the bit matrix being 
representible by an n by n array of H by Q bit submatrices, where n/m = (H/Q) 
5 +1, the bit matrix having a plurality of n!/(m!*(n-m)!) composite submatrices 
definable therefrom, each composite submatrix being definable from the 
intersection of a unique selection of m column(s) of the n by n array and a 
unique selection of (n-m) row(s) of the array that correspond to those (n-m) 
column(s) not included in the unique selection of m column(s), each composite 
10 submatrix being invertible, the method comprising: 

receiving a plurality of n data groupings; and 

executing the set of n data redundancy functions on the plurality of n 
data groupings to thereby generate a plurality of n redundancy data groupings 
associated therewith. 

15 
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10. A method for use in providing adjustable disk loss insurance, 
comprising: 

receiving control data indicative of an amount of disk redundancy 
desired for the disk array; 

selecting a set of data redundancy functions based on the control data; 

and 

generating redundancy data based on user data and the selected set of 
data redundancy functions. 



<WO 0110043A1_I_> 



138 



WO 01/10043 PCT/US00/20398 

11. A programmable gate array (PGA) operative to provide for 
adjustable data redundancy, comprising: 

logic gates configured to selectively execute one of at least a first set of 
XOR-based functions and a second set of XOR-based functions; 
5 the first set of XOR-based functions being representable by a first bit 

submatrix having dimensions of n*Hi by n*Qi, the first bit matrix being such 
that: 

the first bit matrix is representible by a first n by n array of Hi by 
Qi bit submatrices, where n/mi = (Hi/Qi) + 1, and 1 < mi < n; 

10 the first bit matrix has a first plurality of n!/(mi!*(n-mi)!) 

composite submatrices definable therefrom, each composite submatrix of the 
first plurality being definable from the intersection of a unique selection of mi 
column(s) of the first n by n array and a unique selection of (n-mi) row(s) of the 
first n by n array that correspond to those (n-mi) column(s) not included in the 

15 unique selection of mi column(s), where each composite submatrix of the first 
plurality is invertible; 

the second set of XOR-based functions being representable by a second 
bit matrix having dimensions of n*H2 by n*Q2, the second bit matrix being such 
that: 

20 the second bit matrix is representible by a second n by n array of 

H2 by Q2 bit submatrices, where n/m2 = (H2/Q2) + 1, and mi < m2 < n; and 
the second bit matrix has a second plurality of n!/(m2!*(n-m2)!) 
composite submatrices definable therefrom, each composite submatrix of the 
second plurality being definable from the intersection of a unique selection of 

25 m2 column(s) of the second n by n array and a unique selection of (n-m2) row(s) 
of the second n by n array that correspond to those (n-m2) column(s) not 
included in the unique selection of m2 column(s), where each composite 
submatrix of the second plurality is invertible. 
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12. A method for use in a radio frequency (RF) communication system, 
the method involving data comprising a plurality of n data groupings 
respectively associated with a plurality of n parity groupings, where n and m 
are integers and l<m<n, the method comprising: 

5 in a first communication device: 

modulating radio frequency (RF) signals with the data and 
providing transmission over a plurality of n channels, each channel being 
associated with one of the n data and parity groupings; 
in a second communication device: 
10 receiving and demodulating the modulated RF signals to 

regenerate the data, which may have errors from interference; 

determining whether and which n data groupings are adversely 
affected with errors; and 

recovering the data from (n-m) data groupings when the other m 
15 data groupings are determined to be adversely affected with errors. 

13. The method according to claim 12, wherein the plurality of n 
channels are defined by n different RF frequencies. 

20 14. The method according to claim 12, wherein the plurality of n 

channels are defined by n different time slots of a single RF frequency. 

15. The method according to claim 12, wherein the plurality of n 
channels are defined by n different code divisions. 
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16. A method for use in a computer network having at least n servers, 
the method involving data and parity spread across the n servers, where m and 
n are integers and l<m<n, the method comprising: 

sending a data request to the n servers; 
5 receiving a response from the first (n-m) of n servers; 

selecting a data recovery function based on which (n-m) of n servers first 
responded; and 

recovering the data based on the data and parity received from the (n-m) 
servers that first responded and the selected data recovery function. 

0 

17. The method according to claim 16, where m = n-1 
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18. A computer- implemented method for use in determining whether 
data representing a candidate bit matrix is a Wiencko bit matrix, the candidate 
bit matrix having dimensions of n*H by n*Q and being representible by an n by 
n array of H by Q submatrices, where n, m, H, and Q are positive integers, 1 < 
5 m < n, and n/m = (H/Q) + 1, the method comprising: 

selecting an H*m by Q*(n-m) composite submatrix from the candidate bit 
matrix, the composite submatrix formed by submatrices from the intersection 
of a unique selection of m column(s) of the array and a unique selection of (n- 
m) row(s) of the ^rray that correspond to those (n-m) column(s) not included in 
10 the unique selection of m column(s); 

determining whether the selected composite submatrix is invertible; 

repeating the selecting and testing as needed to determine whether all 
n!/(m!*(n-m)!) composite submatrices of the candidate bit matrix are invertible; 
and 

15 providing an indication as to whether or not all n!/(m!*(n-m)!) composite 

submatrices are invertible based on the selecting, testing, and repeating. 
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